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GENETIC ANALYSIS 



5 INTRODUCTION 

Limitations of the current approaches 

There are a number of limitations to carrying out association 
studies using single nucleotide polymorphisms (SNPs) and linkage 
io disequilibrium within human populations (see Science, Vol 278, p1580, 
(1997) for a review of such methods). We have no control over 
recombination frequency around a given locus or over past human genetic 
crossing. Some mutations will be closely correlated with nearby SNPs and 
others will not. 

15 

The need for whole genome analysis 

With the SNP and linkage disequilibrium approach (and many 
others), markers are essentially used as a surrogate for sequencing - the 
more markers, the better. The logical endpoint of the above argument is to 

20 look at every base in the human genome - and carry out what could be 
termed a whole genome association study. In essence, the sequence at 
every base would be determined for the genome of each member of a 
phenotypicaliy 'affected 1 and a phenotypically 'unaffected' population. 
Statistical correlations (associations) could then be drawn between 

25 sequence differences and phenotype. Such associations would have 

future predictive values for the phenotype of interest, knowing the genotype 
and could be of great value in medicine and pharmacogenetics. 

The current invention selectively enriches for DNA fragments 
that determine phenotype in the 'affected' population and thus makes the 

30 prospect of carrying out whole genome association studies for humans and 
other species a very real possibility. 
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Definition of terms used with the current invention 

Within the scope of the current invention, the individuals 
chosen for whole genome analysis may be human, animal or plant and 
they may be eukaryotic, prokaryotic or archaebacterial in origin. 

5 The terms 'affected' and 'unaffected' are used without 

limitation r order to categorise individuals into two groups - those that 
possess a defined phenotype of interest ('affected' individuals) and those 
that do not possess the phenotype of interest ('unaffected' individuals). 
The phenotype common to the 'affected' individuals may be either 

10 beneficial (e.g. for these individuals, a particular pharmaceutical entity 
might show high efficacy in a phase II clinical trial) or detrimental (e.g. for 
these individuals, a particular pharmaceutical entity might show adverse 
toxicology in a phase I clinical trial). 

The 'affected' population may comprise one or more 

15 individuals and the 'unaffected' population may similarly comprise one or 
more individuals according to the particular embodiment of the invention 
(see below). 

The term DNA is used throughout for simplicity. Within the 
scope of the current invention, the term DNA may equally well apply to all 

20 or part of the haploid, diploid or polyploid genomic DNA content of one or 
more germ line or somatic cell(s). The DNA may be extracted from cells 
taken directly from the individual(s), the DNA may be extracted from cells 
cultured or immortalised from the individual(s) or the DNA may be prepared 
from a library of clones - with inserts derived from the individual(s) and 

25 propagated in some appropriate host and cloning vector system. For the 
particular case wherein the term DNA refers to the expressed part of the 
haploid, diploid or polyploid genomic DNA content of one or more somatic 
cells and the DNA is prepared from a library of clones - with inserts derived 
from the individual(s) and propagated in some appropriate host and cloning 

30 vector system, a cDNA library (normalised or otherwise) may be used. 
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In the current invention, DNA is compared in fragmented 
form. Fragmentation can be performed after DNA extraction, prior to 
cloning and/or after cloning. Restriction enzyme digestion is the preferred 
method for such fragmentation - though other methods (e.g. shearing or 
5 sonication) will be obvious to those skilled in the art. 

For the particular case wherein the DNA is prepared from a 
library of clones (either genomic clones or cDNA clones) - with inserts 
derived from the individual(s) and propagated in some appropriate host and 
cloning vector system and wherein restriction enzyme fragmentation is 

10 used prior to cloning, polymerase chain reaction amplification can be used 
to prepare the DNA for comparison in fragmented form. Priming sites 
within the vector sequence flanking the cloned restriction enzyme 
fragmented inserts may be usefully employed for one or more cycles of 
polymerase chain reaction amplification of the fragmented DNA of interest. 

15 The primers used for polymerase chain reaction amplification of the 
fragmented DNA of interest could again be used after the phenotype- 
determining fragment enrichment process to 'rescue' and clone the 
enriched fragments. 

Within the scope of the current invention, the terms 

20 biotinylation and streptavidin capture are used both as an example and as 
the currently preferred embodiment for the invention. The streptavidin may 
be surface attached to inert particles (magnetic or otherwise) or to vessel 
walls (e.g. microtitre plate wells). The biotin may be introduced via a 
deoxynucleotide triphosphate analogue using a polymerase; by using a 

25 biotin-conjugated primer and polymerase chain reaction amplification; 
chemically or photochemically. The use of biotin and streptavidin is not a 
limitation for the invention. The invention could equally be used with other 
high affinity capture systems well known to those skilled in the art (e.g. 'his 
tag' introduction and metal ion affinity capture). 

30 Within the scope of the current invention, the term 'abnormal' 

- used with respect to the term 'normal* - is used without limitation in order 
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to denote a somatic cell (or somatic cells) with a discemable phenotypic 
characteristic (or characteristics) arising from the acquisition of a different 
somatic mutation (or set of somatic mutations) from that (or those) seen in 
the 'normal' counterpart. Cells will most usually be considered 'abnormal 1 
5 with respect to their 'normal' counterparts through the acquisition of a 
different somatic mutation (or set of somatic mutations) leading to one or 
more of the following phenotypic characteristics: altered marker gene 
expression, altered genomic organisation, growth under certain selective 
culture conditions, immortalised growth in culture, unrestrained growth in 

10 vivo or in vitro, failure of normal apoptotic control mechanisms in vivo or in 
vitro, induction of neovascularisation, escape of cells across epithelium, 
migratory cell survival or metastasis. 

Within the scope of the current invention, the term mismatch 
recognition protein is used without limitation to denote a protein of 

is eukaryotic, prokaryotic or archaebacterial origin capable of the selective 
recognition of (and binding to) a DNA duplex that is not perfectly matched 
along its entire length. Recognition of (and binding to) will be preferably for 
bases that are not engaged in correct Watson and Crick pairing and for 
small deletions or insertions. Many such proteins are known to those 

20 skilled in the art. Prokaryotic and eukaryotic mutS homologues, phage T4 
endonuclease VII, phage T7 endonuclease I and the plant enzyme CEL-t 
are just some examples. 

Inter-population perfectly matched duplex depletion 

25 In the inter-population perfectly matched duplex depletion 

approach, we compare (in fragmented form) the pooled DNA of 'affected 1 
individuals with the pooled DNA of 'unaffected' individuals (both from 
populations as outbred and otherwise similar to each other as possible). 
We are only interested in those regions where differences occur between 

30 'affected' and 'unaffected' DNA molecules. For populations as above, the 
only prevalent sequence differences within the 'affected' pool (compared to 
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the 'unaffected' pool) should be somewhere within the gene(s) (using the 
term gene in its widest sense to include exons, introns and all associated 
upstream and downstream regulatory sequences) that actually 
determine(s) their common phenotype. This means that we are no longer 
5 tied into working with rare (and perhaps atypical) populations where there 
is high genetic homogeneity. 

Pooling the DNA from entire phenotypically-defined 
populations massively reduces the amount of labour involved. 

10 DNA sequence variation in populations 

Nickerson et al, Nature Genetics, Vol 19, p233 (1998), sequenced 9.7 kb of 
the lipoprotein lipase gene from 71 individuals (24 African-Americans, 24 
Europeans and 23 European-Americans). This gene is fairly typical (90 % 
intron and 10 % exon - total size 30 kb with 10 exons). 

15 88 sequence variants were found (i.e. one per 1 10 bp on 

average). Most variations were found in non-coding sequence. 90 % of 
these were SNPs (60 % of which were transitions and 40 % were 
transversions). All of the SNPs were biallelic. 10 % of the sequence 
variants were insertions or deletions at repeat sequences. 

20 58 % of the sequence variants were present in all three ethnic 

populations. Half of these were found in heterozygous form and half in 
homozygous form. 

Nucleotide diversity (defined as the expected number of 
nucleotide differences per site between a random pair of chromosomes 

25 drawn from the population) is 1/500 for DNA in general and 1/2,000 for 
coding sequence DNA. This means that, on average, any two DNA 
fragments annealed together from such a population will contain a 
mismatch every 500 bases. 

DNA sequence variants are therefore very common. They 

30 are not, however, totally random - the variants that occur every 500 bases 
or so are limited; they are generally biallelic at just that single base. It is 
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this fact that the inter-population perfectly matched duplex depletion 
approach selectively exploits. 

Statistical analysis of the inter-population perfectly matched duplex 
5 depletion fragmentation process 

If the length of the DNA fragments is F bases and the 
average length between sequence differences between any two DNA 
molecules is 500 bases, the probability that a hybrid duplex between any 
two random DNA fragments will contain no mismatches is given by 

10 

Pr(0) = e-< F/500 > 

and the probability that a hybrid duplex between any two random DNA 
molecules will contain any mismatches is given by 

15 

Pr(>1) = {1-(e* (F/500) )} 

Example values for Pr(0) and Pr(>1) for different values of F 
are given below 



F 


Pr{0) 


Pr(>1) 


10 


0.98 


0.02 


20 


0.96 


0.04 


50 


0.90 


0.10 


100 


0.82 


0.18 


200 


0.67 


0.33 


300 


0.55 


0.45 


400 


0.45 


0.55 


500 


0.37 


0.63 


600 


0.30 


0.70 


700 


0.25 


0.75 


800 


0.20 


0.80 


900 


0.17 


0.83 


1000 


0.14 


0.86 
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Example average restriction fragment sizes for DNA digestion with six 
6 bp cutters and up to four 4 bp cutters 

Each 6 bp cutter will cut DNA every 4.096 bp on average and 
each 4 bp cutter will cut DNA every 256 bp on average. 
5 For a given set of A 6 bp cutters and B 4 bp cutters (with no 

duplication of restriction enzyme cutting sequence), the average fragment 
length F will be 

{(A/4,096)+(B/256)}' 1 

io 

Example values for F as A and B are varied are given in the 
following table 



A 


B 


average size 
(F) 


6 


0 


683 


6 


1 


186 


6 


2 


108 


6 


3 


76 


6 


4 


59 



1 5 We should note that the above situation assumes that none of 

the 4 bp cutter recognition sites lie within any of the 6 bp cutter recognition 
sites. If, for example, we have a 4 bp cutter recognition site nested within a 
6 bp cutter recognition site (e.g. from the use of Mbolanti BamHI in the 
fragmentation), then we should reduce the value of A from 6 to 5 in the 

20 above. 

In general, if we have a given set of A enzymes that cut DNA 
every a bp on average, B enzymes that cut DNA every b bp on average, 
... , Z enzymes that cut DNA every z bp on average (with no duplication of 
restriction enzyme cutting sequence), the average fragment length F will be 

25 

{(A/a)+(B/b)+... + (Z/z)}- 1 
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Example sets of 6 bp cutters and 4 bp cutters for fragmentation 

Example sets of 6 bp cutters and 4 bp cutters that contain 
panels of six 6 bp cutters that are compatible with terminal restriction site 
profiling array (TRSPA) analysis (see below) are given in the following: 

Example enzyme set 1 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

0/ 
/o 

activity 


Optimum 

•c 


\Jf\i\ 


Supplier 


Inactivate 
°C/min 


1 


Mboi 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelff 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 
°C 


U/fil 


Supplier 


Inactivate 
°C/min 


1 


BamHI 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGI 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


HindUI 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncol 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spel 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Aflll 


CTTAAG 


100 


37 


10 


APB 


60/15 



10 

Example enzyme set 2 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U/^il 


Supplier 


Inactivate 
°C/min 


1 


Mbof 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelll 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 
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Number 


Enzvmp 


Site 


M+BSA 

% 

activity 


Optimum 
Z C 


U / ul 


Supplier 


Inactivate 
•C/min 


1 


EcoRl 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHI 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bglli 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbaf 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65l 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaLI 


GTGCAC 


100 


37 


10 


NEB 


No 



Aspect (I) 

In one aspect the invention provides a method of providing a 
mixture of DNA fragments enriched in fragments that are characteristic of a 
5 phenotype of interest, by providing affected DNA in fragmented form and 
providing unaffected DNA in fragmented form, which method comprises: 

a) mixing the fragments of the affected DNA and the fragments 
of the unaffected DNA under hybridising conditions; 

b) recovering a mixture of hybrids that contain mismatches; 

10 c) recovering fragments of the affected DNA from the mixture of 

hybrids that contain mismatches; 

and optionally repeating steps a), b) and c) one or more 

times. 

is Inter-population mismatch-containing duplex selection 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture), taking fragmented 
20 and denatured 'affected' DNA and hybridising this to an excess of 

fragmented, denatured and biotinylated 'unaffected' DNA with ensuing 
capture of mismatch-containing duplex molecules. Releasing the 
mismatch-containing duplex molecules without denaturation, streptavidin 
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capture and then release of the non-biotinylated strands will give only the 
desired species as shown below. 

Method 

5 Fragment the 'affected' DNA. Fragment and derivatise the 

'unaffected' DNA with biotin. Only DNA from this population will be 
streptavidin captured. Melt and anneal to give 



biotin * biotin 

biotin Biotin v 

biotin * biotin 



io Mismatch-binding protein select. Capture only the mismatch- 

containing duplexes. Release without denaturation to give 



a biotin 

biotin v 

^ biotin 



v 
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Streptavidin capture. to give 

a biotin 

biotin V 

^ biotin 



Release the non-biotinylated strands to give 



biotin 



Repeat as necessary. 

Repetition of the above sequence of reactions will lead to 
inter-population perfectly matched duplex depletion. 

io 

What will be purified ? 

Inter-population mismatch-containing duplex selection as 
above ensures that all of the various phenotype-determining fragments 
(unique to the 'affected 1 population) are captured for subsequent analysis - 
15 but it also causes the co-purification of very many SNP-containing ('noise 1 ) 
fragments. 

We now need to consider the fate of the various types of 
fragment (i.e. those that determine the phenotype and those that do not) as 
we carry out inter-population perfectly matched duplex depletion cycles as 
20 above. 
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Recovery of 'affected' DNA molecules after streptavidin capture 

For a particular fragment, if we have X molecules of 'affected' 
DNA and Y molecules of 'unaffected 1 biotinylated DNA, after complete 
hybridisation, there will be a ratio of {Y/(X+Y)} streptavidin-capturable 
5 molecules to {X/(X+Y)} streptavidin-non-capturable molecules. We can 
thus manipulate the yield of streptavidin-capturable hybrids by varying X 
and Y. 

Example recovery and loss figures for various X and Y are 
shown in the following table 

10 



ratio 
(Y/X) 


recovery 
= {Y/(X + Y)} 


Loss 
= (X/(X+Y)} 


1 


50% 


50% 


2 


67% 


33% 


3 


75% 


25% 


4 


80% 


20% 


9 


90% 


10% 


19 


95% 


5% 


99 


99% 


1 % 


999 


99.9 % 


0.1 % 



After n cycles, the recovery for a phenotype-determining 
fragment will be given by {Y/(X+Y)} n 

15 Loss of general SNP-containing ('noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

If we anneal fragmented DNA molecules together and capture 
only the mismatch-containing duplexes, then n repetitions of such a 
process will reduce the original number of fragments to the following 
20 fraction 



{1-(e <F/500> )} n -{Y/(X+Y)} n 
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The enrichment for phenotype-determining fragments over 
SNP-containing ('noise') fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

Example figures for enrichment are given below with F = 50, 
1 00, 200. 300, 400, 500, 600, 700, 800, 900 and 1 ,000 and n = 1 , 2, 3, 4, 5, 
6 ,7, 8, 9 and 10. 

io 





n = 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


11 


110 


1,160 


12,194 


128,135 


1.346,489 


1.E+07 


1.E+08 


2.E+09 


2.E+10 


100 


6 


30 


168 


926 


5,110 


28.187 


155.500 


857,841 


4,732,415 


3.E+07 


200 


3 


9 


28 


85 


257 


779 


2.362 


7,166 


21,735 


65,929 


300 


2 


5 


11 


24 


53 


119 


263 


582 


1,291 


2,860 


400 


2 


3 


6 


11 


20 


36 


65 


118 


215 


390 


500 


2 


3 


4 


6 


10 


16 


25 


39 


62 


98 


600 




2 


3 


4 


6 


9 


12 


18 


25 


36 


700 




2 


2 


3 


4 


5 


7 


10 


13 


17 


800 




2 


2 


2 


3 


4 


5 


6 


8 


10 


900 




1 


2 


2 


2 


3 


4 


4 


5 


6 


1.000 




1 


2 


2 


2 


2 


3 


3 


4 


4 



Loss of specific SNP-containing ('noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

The non-polymorphic and SNP-containing ('noise') fragments 
15 will be depleted as described above. 

Not all fragments will, however, be depleted at the same rate. 
An individual SNP-containing ('noise') fragment will be depleted with every 
cycle of inter-population perfectly matched duplex depletion as outlined 
below. 

20 Let us assume that there are two alleles for the polymorphism 

within a particular fragment. Let these be called P and Q. Let p be the 
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fraction of the P allele in the (outbred) 'affected' and 'unaffected' 
populations and let q be the fraction of the Q allele. Let (p+q)=1 , so that 

q=(1-p). 

After denaturation and annealing, the four possible events are 
5 PP, PQ, QP and QQ. PP and QQ will form perfectly matched duplexes and 
will therefore be lost - whereas PQ and QP will form mismatch-containing 
duplexes and will consequently be recovered. 

After one cycle, the fraction of recovered molecules will be 

I0 {2pq/(p 2 +2pq+q 2 )} 



= {2pq/((p+q) 2 )} 
= 2pq 

15 

= 2p(1-p) 

Hence if we start out with M molecules of DNA from the 
population, there will be 2Mpq molecules remaining after the first round of 
20 inter-population mismatch-containing duplex selection. Let us denote the 
number of molecules entering the second round of inter-population 
mismatch-containing duplex selection by M\ where 



M' = 2Mpq 

25 

and the fraction of lost molecules will be 



{(p 2 + q 2 )/(p 2 +2pq+q 2 )} 



30 



= {(p 2 +q 2 )/((p+q) 2 )} 
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2 2 

= P +q 

= p 2 +{1-2p+p 2 } 
5 = 1-2p+2p 2 

= 1-2p(1-p) 
=1-2pq 

10 

The fractional loss of P-allelic molecules will be p 2 and the 
fractional loss of Q-allelic molecules will be q 2 ( = (1-p) 2 ). 

If we start out with M molecules of DNA from the population, 
there will be Mp of the P-alleiic molecules and Mq = M(1-p) of the Q-allelic 
I5 molecules before inter-population mismatch-containing duplex selection. 
After inter-population mismatch-containing duplex selection, there will 
therefore be Mp-Mp 2 = Mp(1-p) = Mpq of the P-allelic molecules and Mq- 
Mq 2 = Mq(1-q) = Mqp of the Q-allelic molecules. In other words, after the 
first round of mismatch-containing duplex selection, there will be the same 
20 number of P-allelic molecules as Q-allelic molecules. 

We can define new allelic frequencies p' and q' as follows 

p' = q' = 0.5 

25 If we now perform a second round of inter-population 

mismatch-containing duplex selection, we start out with M' molecules of 
DNA from the first round. There will be M'p' of the P-allelic molecules and 
M'q J of the Q-allelic molecules before inter-population mismatch-containing 
duplex selection. 

30 After inter-population mismatch-containing duplex selection, 

there will therefore be M'p'-Mpp' = M'p'CI-p) = M'p'q of the P-allelic 
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molecules and M'q ? -Mqq ! = M'q : (1-q) = M'q ; p of the Q-allelic molecules. 
The total number of molecules (M") will be 

M'p'q + M'q'p 

5 

= M'(p ! q-q'p) 
but p* = q' = 0.5 
io hence M" = 0.5*M'(p+q) 

but (p+q) = 1 
so M" = (MV2) 

I5 

In other words, after the second round of inter-population 
mismatch-containing duplex selection, there will again be the same number 
of P-allelic molecules as Q-allelic molecules. Thus the new allelic 
frequencies p" and q" remain as previously p" = q" = 0.5. 

20 A pattern now emerges. After the first cycle of inter- 

population mismatch-containing duplex selection, the allelic frequencies are 
equalled and the number of molecules is reduced to 2Mpq (Mpq of each 
allelic molecule). Thereafter, every cycle halves the number of molecules 
and keeps both alleles at the same frequency. 

25 Consequently, after n cycles, the number of P-allelic 

molecules will be 

Mpq(0.5) n - 1 

30 and the number of Q allelic molecules will be 
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Mqp(0.5) n ' 1 



both are. of course, equal. 

If we now take the capture yield (see above) into 
consideration, the SNP-containing ('noise') fragment yield will be given by 



2Mpq(0.5) n " 1 -{Y/(X+Y)} n 



where both allelic variants are deemed to be captured l noise\ 

K) 

Polymorphisms that interfere with the pattern of restriction digestion 

For both the loss of general and specific SNP-containing 
('noise 5 ) fragments during inter-population perfectly matched duplex 
depletion cycles (described above) and where the SNP interferes with the 

15 pattern of restriction digestion, if the mismatch-binding protein also binds to 
duplex molecules with unequal lengths (e.g. from inter-population 
annealing around a site of restriction site polymorphism), then the above 
analysis still holds (with perfectly matched duplex being replaced by equal 
length duplex and mismatch-containing duplex being replaced by unequal 

20 partner-length duplex). 

In the rare cases where a restriction site is lost due to a 
sequence change that actually determines the phenotype of interest, a 
double-length fragment will be obtained. This will give rise to a double 
terminal restriction site profiling array (TRSPA) signature (see below). 

25 Multiple isolates of the particular double signature will be indicative of an 
association between the fragment and the phenotype of interest. 
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Further 'kinetic' enrichment to enhance the selective removal of SNP- 
containing noise' from the pool of phenotype determining fragments 

After multiple cycles of enrichment by the above procedure, 
the enriched DNA pool should contain many copies of all phenotype- 

5 determining fragments but also low numbers of copies of many different 
phenotype non-determining fragments. The total number of 'noise' 
fragments may exceed the number of phenotype determining fragments, 
despite the number of each individual 'noise' species being very small. The 
'noise' fragments would therefore increase the number of probes required 

io for TRSPA analysis before a pattern emerges. To largely eliminate this 
problem, a further kinetic enrichment procedure is used. Either one or both 
of strategies A and B below can be employed to achieve 'kinetic' 
enrichment. 

1 5 Strategy A - Subtraction of the enriched DNA from inter-population 
mismatch containing duplex depletion 

The enriched fragment pool from inter-population mismatch 
containing duplex depletion is rapidly self-hybridised - enabling the 
common phenotype-determining fragments to form perfectly matched 
20 duplexes with greater efficiency than the rare 'noise* fragments. Selection 
for perfectly matched duplexes then yields a selectively further enriched 
pool of fragments. Multiple cycles of subtraction could be carried out if 
necessary. 

25 Strategy B - Hybridisation of the enriched DNA from inter-population 
mismatch containing duplex depletion against the 'affected 1 DNA pool 

The enriched fragment pool from inter-population mismatch 
containing duplex depletion is then hybridised to an excess of biotinylated 
DNA from the 'affected' pool. This allows the common phenotype- 
30 determining fragments to form perfectly matched duplexes with greater 
efficiency than the rare 'noise* fragments. Selection for perfectly matched 
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duplexes followed by streptavidin capture and denaturation to release the 
non-biotinylated strands then yields a further enriched pool of fragments. 
Multiple such 'affected' pool back-hybridisations could be carried out if 
necessary. 

5 

Extension of the invention to the case of single phenotypically 
'affected' individuals within populations where the distinction 
between 'affected' and 'unaffected' is clear 

The above has described inter-population perfectly matched 
in duplex depletion between non-biotinylated DNA fragments from an 

'affected' population and biotinylated DNA fragments from an 'unaffected' 
population. Provided the 'unaffected' population is sufficiently complex that 
it contains all the non-phenotype-determining sequence variants found in a 
single 'affected 1 individual, then inter-population perfectly matched duplex 
15 depletion should be possible for single phenotypically 'affected' individuals 
against an 'unaffected' population where the distinction between 'affected' 
and 'unaffected 1 is clear. The latter proviso is needed in order to ensure 
that a small number of misdiagnosed 'affected' individuals in the 
'unaffected* population do not cause the removal of phenotype-determining 
:o fragments during inter-population perfectly matched duplex molecular 
depletion. 

Extension of the invention to the case of disease gene identification 
in cases where novel phenotype-determining mutations arise 
25 spontaneously within a family 

Except for a small number of sequence changes, each of us 
contains DNA sequence derived from our parents - our individuality 
resulting from precisely which parental alleles we receive. If one of the 
above small number of sequence changes results in a change in 
30 phenotype, then we can use inter-population perfectly matched duplex 
depletion to enrich for fragments encoding this change in phenotype. 
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If we take 'unaffected total ancestral' cells (by which we mean 
cells derived from a complete set of 'unaffected' ancestors - e.g. both 
parents, or mother plus two paternal grandparents, or father plus two 
maternal grandparents, or two maternal grandparents and two paternal 

5 grandparents etc. ) as the source of our biotinylated fragments and cells 
from an 'affected* descendent as the source of our non-biotinylated 
fragments, any fragments that have acquired phenotype-determining 
sequence changes between 'unaffected' ancestral generations and the 
'affected' descendent generation will be unable to form perfectly matched 

in duplexes with the biotinylated 'unaffected total ancestral' fragments. 
Successive cycles of such inter-population perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all such 
sequence - the degree of enrichment per cycle being as described below. 

15 Statistical analysis 

Let us assume that equal numbers of fragments are used 
from each of the 'unaffected' ancestors. Let the number of such ancestors 
be A. 

{1/A} of the annealings in the inter-population perfectly 
:o matched duplex depletion will be self-against-ancestral-transmitted alleles - 
statistically equivalent to self-against-self inter-population perfectly matched 
duplex depletion (see below). 

{(A-1)/A} of the annealings in the inter-population perfectly 
matched duplex depletion will be self-against-ancestral-nontransmitted 
25 alleles - statistically equivalent to inter-population perfectly matched duplex 
depletion between unrelated individuals. 



A 


{1/A} 


{(A-1)/A) 


2 


1/2 


V2 


3 


1/3 


2/3 


4 


1/4 
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From the data of Nickerson et al, a DNA sequence variation 
between unrelated individuals should occur every 500 base pairs (it is this 
figure that we use for inter-population perfectly matched duplex depletion 
with self-against-ancestral-nontransmitted alleles and inter-population 

5 perfectly matched duplex depletion between unrelated individuals). In 
addition, a given individual should be heterozygous once every 573 base 
pairs (this figure is used for inter-population perfectly matched duplex 
depletion with self-against-ancestral-transmitted alleles and self-against- 
self inter-population perfectly matched duplex molecular depletion). Inter- 

io population perfectly matched duplex depletion against the transmitted 
alleles and the nontransmitted alleles will now be considered separately. 

Self-against-ancestral-transmitted alleles inter-population perfectly 
matched duplex depletion 

15 If we anneal the two complementary strands for a fragment 

using DNA from an 'affected* descendent and DNA containing the 
transmitted alleles of 'unaffected' ancestors, then {1-e ( " F/573) } of fragments 
will contain one or more site of heterozygosity. {1/A} of the annealings will 
be of this type. For such annealings, where a site of heterozygosity is 

20 present, the probability of obtaining a mismatch-containing duplex between 
a biotinylated fragment and a non-biotinylated fragment containing the site 
of heterozygosity is 0.5. 

Self-against-ancestral-nontransmitted alleles inter-population 
25 perfectly matched duplex depletion 

If we anneal the two complementary strands for a fragment 
using DNA from an 'affected' descendent and DNA containing the 
nontransmitted alleles of 'unaffected' ancestors, then {1-e ( * F/500) } of 
fragments will contain one or more site of DNA sequence variation. {(A- 
30 1 )/A} annealings will be of this type. 
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Phenotype-determining fragment enrichment 

The fraction of fragments carried through the first cycle of 
inter-population perfectly matched duplex depletion will therefore be 

5 ([0.5-{1/AH1-e ( ' F/ ^ 

where {Y/(X+Y)} represents the yield of streptavidin- 
capturable molecules. 

Hence n repetitions of such a process will reduce the original 
it) number of fragments to the following fraction 

([0.5.{1/AHl-e ( - F/573) }]+[^ 



The enrichment for phenotype-determining fragments over 
!5 SNP-containing ('noise 1 ) fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

([0.5-{1/AH1-e { - F ^ 

20 Example figures for enrichment are given for A = 2, 3 and 4 

below with F = 50, 100, 200, 300, 400, 500, 600, 700, 800, 900 and 1,000 
and n = 1 , 2, 3, 4 , 5 ,6 ,7, 8, 9 and 10. 
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For A=2 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


15 


213 


3,115 


45.494 


664.427 


9.703,694 


1.Et08 


2.E+09 


3.E+10 


4.'E+11 


100 


8 


59 


448 


3.430 


26.250 


200.887 


1.537,367 


1.E+07 


9.E+07 


7.E+08 


200 


4 


18 


74 


309 


1.296 


5,434 


22.782 


95.524 


400.520 


1,679,340 


300 


3 


9 


28 


87 


265 


811 


2,475 


7,558 


23,078 


70,468 


400 


2 


6 


16 


39 


97 


241 


600 


1.497 


3.734 


9,313 


500 


2 


5 


10 


22 


48 


103 


224 


485 


1,051 


2,277 


600 


2 


4 


7 


15 


29 


56 


109 


213 


416 


813 


700 


2 


3 


6 


11 


19 


35 


63 


114 


207 


374 


800 


2 


3 


5 


8 


14 


24 


42 


71 


121 


205 


900 


2 


3 


4 


7 


11 


18 


30 


49 


79 


128 


1,000 


2 


2 


4 


6 


9 


15 


23 


36 


57 


89 



For A=3 

5 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


13 


167 


2.154 


27,823 


359,340 


4,640,941 


6.E+07 


8.E+08 


1.E+10 


1.E+11 


100 


7 


46 


311 


2,104 


14,251 


96,516 


653.682 


4.427,241 


3.E+07 


2.E+08 


200 


4 


14 


51 


191 


709 


2,633 


9.786 


36.365 


135.135 


502,173 


300 


3 


7 


20 


54 


146 


396 


1,073 


2,909 


7.882 


21,361 


400 


2 


5 


11 


24 


53 


119 


263 


582 


1,290 


2,859 


500 


2 


4 


7 


14 


27 


51 


99 


190 


367 


707 


600 


2 


3 


5 


9 


16 


28 


48 


84 


147 


255 


700 


2 


3 


4 


7 


11 


18 


28 


46 


74 


119 


800 


2 


2 


4 


5 


8 


12 


19 


28 


43 


66 


900 


1 


2 


3 


4 


6 


9 


14 


20 


29 


41 


1,000 


1 


2 


3 


4 


5 


8 


11 


15 


21 


29 
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For A=4 





n= 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


12 


149 


.1,826 


22.316 


272,759 


3,333,771 


4.E+07 


5.E+08 


6.E+09 


7.E+10 


100 


6 


41 


264 


1.690 


10.834 


69,465 


445.373 


2.855.512 


2.E+07 


1.E+08 


200 


4 


12 


44 


154 


540 


1.902 


6.696 


23,570 


82,967 


292,045 


300 


3 


7 


17 


44 


112 


287 


737 


. 1.894 


4,864 


12,494 


400 


2 


4 


9 


20 


41 


86 


181 


381 


800 


1,681 


500 


2 


3 


6 


11 


20 


37 


68 


125 


229 


418 


600 


2 


3 


5 


7 


12 


20 


34 


56 


92 


152 


700 


2 


2 


4 


5 


8 


13 


20 


30 


46 


71 


800 


1 


2 


3 


4 


6 


9 


13 


19 


27 


39 


900 


1 


2 


3 


4 


5 


7 


9 


13 


18 


25 


1.000 


1 


2 


2 


3 


4 


6 


7 


10 


13 


17 



Extension of the invention to the case of fully comprehensive 
5 'abnormal' cell mutational profiling within an individual 

If we now take 'normal* cells as the source of our biotinylated 
fragments and 'abnormal' cells from the same individual as the source of 
our non-biotinylated fragments, any fragments that have acquired 
sequence changes on the way to becoming 'abnormal' will be unable to 

io form perfectly matched duplexes with the biotinylated 'normal' fragments. 
Successive cycles of such inter-population perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all those 
sequence differences between the 'normal' cells and the 'abnormal' cells - 
the degree of enrichment per cycle being as described below. 

15 From the data of Nickerson et al, a given individual should be 

heterozygous about once every 573 base pairs. 

If we anneal the two complementary strands for a fragment 
using DNA from 'abnormal' cells and DNA from 'normal' cells, then 
{1-e ( " F/573) } of fragments will contain one or more site of heterozygosity. 

20 For a heterozygous site, p and q are both 0.5. The probability 

of obtaining a perfectly matched duplex between a biotinylated fragment 
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and a non-biotinylated fragment containing the site is 0.5. Similarly, the 
probability of obtaining a mismatch-containing duplex between a 
biotinylated fragment and a non-biotinylated fragment containing the site is 
also 0.5. 

5 The fraction of fragments carried through the first cycle of 

inter-population perfectly matched duplex depletion will therefore be 

0.5.{1 - e ( ' F/573) } -{Y/(X+Y)} 

in hence n repetitions of such a process will reduce the original 

number of fragments to the following fraction 



[0.5.{1-e ( " F/573) }] n -{Y/(X+Y)} n 



15 The enrichment for phenotype-determining fragments over 

SNP-containing ('noise 1 ) fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

[0.5-{1 -e ( - p/573) }]- n 

20 

Example figures for enrichment are given below with F = 50, 
100, 200, 300, 400, 500, 600, 700, 800, 900 and 1,000 and n = 1, 2, 3, 4, 5, 
6, 7, 8, 9 and 10. 
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p 


i 


d. 




A 
*t 


c 
D 


D 


7 


Q 
O 




1 n 




24 


573 


13.71 1 


328.171 


7 RS4 6?7 


2.E+08 


4 5+09 


1.E-r1 1 


3.E+12 


6.E+13 


100 


12 


156 


1.948 


24.329 


303,844 


3.794,731 


5.E+07 


6.E+08 


7.E+09 


9.E+10 


200 


7 


46 


313 


2.123 


14.412 


97,830 


664,076 


4.507.784 


3.E+07 


2.E+08 


300 


5 


24 


118 


580 


2.845 


13.958 


68.490 


336.072 


1,649,059 


8,091,708 


400 


4 


16 


63 


251 


999 


3.977 


15.831 


63.012 


250.815 


998,346 


500 


3 


12 


41 


139 


479 


1.644 


5.650 


19,411 


66.688 


229,114 


600 


3 


9 


29 


90 


278 


856 


2.638 


8.128 


25,047 


77,179 


700 


3 


8 


23 


65 


183 


520 


1.475 


4,183 


11.862 


33,639 


800 


3 


7 


19 


50 


133 


353 


937 


2,491 


6,621 


17,599 


900 


3 


6 


16 


41 


103 


259 


654 


1.652 


4.171 


10,532 


1,000 


2 


6 


14 


34 


84 


202 


490 


1,188 


2,880 


6,978 



This approach will enrich for fragments containing all 
sequence differences within the 'abnormal' cells - no prior knowledge of the 
5 genes (oncogenes, tumour suppressor genes etc. ) that may need to be 
investigated is required. 

Having thus isolated a fragment (or fragments) determining 
differences between the 'affected' and 'unaffected' populations, we can 
now proceed to their analysis. 

10 

Aspect (II) 

In another aspect the invention provides a method of making 
a set of arrays of fragments of DNA of interest, which method comprises: 



a) selecting, from a set of n restriction endonuclease enzymes," a 
15 subset of r restriction endonuclease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

c) ligating to the resulting fragments restriction-enzyme-cutting- 
site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

20 d) splitting the resulting fragments into r 2 aliquots; 

e) amplifying each aliquot with two-restriction enzyme-specific 
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primers; 

f) forming an array of the r 2 aliquots of the amplimers; 
and 

g) repeating steps a) to f) using a different subset of r restriction 
5 endonuclease enzymes. The invention also includes sets of arrays 

obtained or obtainable by the method. 

The n restriction endonuclease enzymes may be selected 
from 4-cutters and 5-cutters and 6-cutters, and a set may include enzymes 
from one or two or three of these categories. The value of n is preferably 3 
io to 1 0, for reasons discussed below. The value of r is less than n and is 
preferably 2 to 4, chosen with reference to the frequency with which the 
chosen enzymes cut nucleic acids, and ease of fragment amplification by 
PCR. 

15 Terminal restriction site profiling arrays (TRSPAs) 

If we use a total of n 6 bp cutter restriction enzymes within the 
total set of enzymes used for fragmentation, let us use subsets of r 6 bp 
cutter enzymes (taken from the total set n) to make (rxr) TRSPA test 
matrices as follows. 

20 

Version 1 -(TRSPA-1) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
25 Split into r 2 aliquots and for each aliquot, amplify with biotinylated restriction 
enzymej and non-biotinylated restriction enzyme k tag-specific primers and 
array the non-biotinylated strands for all values of j and k between 1 and r. 
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Example 

Consider the following dsDNA 

~[A]«1 -[B]-2-[B]-3-[C]--4-[A]-5-[B]- + 
5 -[A]-1 -[B]»2-[B]-3--[C]»4-[A]-5-[B]- - 

where the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1 , 2, 3, 4 
and 5. + and - denote the sense of the strands, 
lo Cut to completion with A, B and C to give 
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A--1--B B--2--B B--3--C C--4--A A--5--B + 
A--1--B B-2--B B--3--C C--4--A A--5-B - 

Polymerase chain reaction amplify according to the following 
5 primer matrix and streptavidin capture to give 





A 


B 


C 


bio-A 




bio-A»1--B + 
A--1--B- 

bio-A--5--B + 
A--5--B - 


C--4--A + 
C~4--A-bio - 


bio-B 


A--1-B + 
A--1~B-bio- 

A--5--B + 
A«5--B-bio - 


bio-B-2-B + 
B--2--B - 

B--2--B + 
B-2-B-bio - 


bio-B--3--C + 
B-3--C - 


bio-C 


bio-C--4~A + 
C--4--A - 


B--3--C + 
B--3--C-bio - 





Keep only the non-biotinylated strands to give 





A 


B 


C 


bio-A 




A--1--B- 
A--5--B - 


C--4--A + 


bio-B 


A--1-B + 
A--5--B + 


B--2--B - 
B--2--B + 


B--3--C - 


bio-C 


C--4--A - 


B--3--C + 
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In simplified form 





A 


B 


C 


bio-A 




1-. 5- 


4+ 


bio-B 


1+. 5+ 


2-, 2+ 


3- 


bio-C 


4- 


3+ 





Repeat for all n C r combinations of restriction enzymes to 
5 generate the TRSPA. 

TRSPA-1 hybridisation patterns 

The two types of TRSPA-1 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
io duplex depletion are 

1 . Hybridisation to an off-diagonal element (e.g. row x, column y 
- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2. Hybridisation to an on-diagonal element (e.g. the element at 
1 5 rowz, column z). 

TRSPA-1 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 

20 -C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
where 

A, B and C denote restriction enzyme cutting sites. 1,2,3, 4, 
5, 6 t 7, 8 and 9 denote the restriction fragments after digestion. 

25 

The TRSPA-1 test matrices 

There are 3 C 2 =3 TRSPA-1 test matrices - AB, BC and AC. . 
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The test matrix hybridisation patterns 

There are {(2*(2+1))/2}=3 possible hybridisation patterns for 
each of the test matrices 



The combinatorial diversity 

If there are three possible TRSPA-1 test matrix hybridisation 
patterns and three possible test matrices, then there will be 3 3 =27 possible 
TRSPA-1 signatures. 

10 

Fragment 4 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



-C-1-B 
B-2-B 
20 B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 





5 



15 



Cut with A and B to give 



25 



The fragment complementary to fragment 4 will be 
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Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AB matrix A 


B 


bio-A Hn 


bio-B 





? Fragment 4 analysis - the BC TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

io Cut with B and C to give 

-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
15 C-6-C 
C-7-B 
B-8-A-9-A- 

The fragment complementary to fragment 4 will be 

20 

B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 
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BC matrix 


B 


C 


bio-B 






bio-C 







Fragment 4 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

5 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 4 will be 

20 

A-4-A 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 


A 


C 


bio-A 






bio-C 







25 
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Fragment 5 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

5 

Cut with A and B to give 

-C-1-B 
B-2-B 
l(.) B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 

15 

The fragment complementary to fragment 5 will be 
A-5-C-6-C-7-B 

20 Polymerase chain reaction amplify and hybridise with just 

fragment 5 to give 



"a [~b I 

:39 



AB matrix 



bio-A 



bio-B 



Fragment 5 analysis - the BC TRSPA-1 matrix 

25 Take the dsDNA 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
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Cut with B and C to give 

-C-1-B 
B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io The fragment complementary to fragment 5 will be 

B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
15 fragment 5 to give 



BC matrix 

bio-B 

bio-C 



Fragment 5 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

20 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 
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C-6-C 
C-7-B-8-A 
A-9-A 
A- 

5 

The fragment complementary to fragment 5 will be 

A-5-C 

io Polymerase chain reaction amplify and hybridise with just 

fragment 5 to give 



AC matrix 

bio-A 

bio-C 



Fragment 6 analysis - the AB TRSPA-1 matrix 

15 Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and B to give 

20 

-C-1-B 
B-2-B 
B-3-A 
A-4-A 
25 A-5-C-6-C-7-B 
B-8-A 
A-9-A- 
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10 



15 



20 



The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



Fragment 6 analysis - the BC TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 

C-6-C 

C-7-B 



The fragment complementary to fragment 6 will be 



Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 




C-6-C 
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BC matrix 


B 


C 


bio-B 






bio-C 







Fragment 6 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

5 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

Cut with A and C to give 

io -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 6 will be 

20 

C-6-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



AC matrix 


A 


c 


bio-A 






bio-C 




■ 
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Overall results 



Fragment 4 TRSPA-1 analysis 



AB 



BC 



AC 







Hi 


■ 




Fragment 5 TRSPA-1 analysis 



AB 



BC 



AC 



Fragment 6 TRSPA-1 analysis 



AB 



BC 



AC 



□a 



Version 2 - (TRSPA-2) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
Split into r 2 aliquots and for each aliquot, amplify with non-biotinylated 
restriction enzymej and non-biotinylated restriction enzyme k tag-specific 
primers and array the denatured strands for all values of j and k between 
1 and r. 



20 
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Example 

Consider the following dsDNA 

„ [A ]« 1 -[B]-2--[B]--3»[C]--4.-[A]--5-[B]« + 
5 .- [ A].-1-[B]»2«[B]»3»[C]»4»[A]--5»[B]-- - 

where the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1, 2, 3, 4 
and 5. + and - denote the sense of the strands. 
io Cut to completion with A, B and C to give 

A--1-B B--2--B B--3--C C-4--A A--5--B + 
A--1--B B--2--B B--3--C C--4--A A--5--B - 



15 Polymerase chain reaction amplify according to the following 

primer matrix to give 
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A 


B 


C 


A 




A--1-B + 


C--4--A + 






A--1--B - 


C--4--A • 






B-2--B + 








B--2--B - 








A--5--B + 








A--5--B - 




B 


A--1--B + 


B-2-B + 


B-2--B + 




A— 1--B - 


B--2-B - 


B--2--B - 




B--2--B + 




B--3--C + 




B--2-B - 




B-3--C - 




A--5--B + 








A--5--B - 






C 


C--4--A + 


B--2-B + 






C--4--A - 


B--2--B - 








B--3--C + 








B--3--C - 





in simplified form 





A 


B 


C 


A 




1,2,5 


4 


B 


1.2,5 


2 . 


2, 3 


C 


4 


2,3 





5 

Repeat for all n C r combinations of restriction enzymes to 
generate the TRSPA. 
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TSPSA-2 hybridisation patterns 

The two types of TRSPA-2 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
duplex depletion are 
5 i . Hybridisation to an off-diagonal element (e.g. row x, column y 

- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2. Hybridisation to a whole row and column intersecting at an 

on-diagonal element (e.g. all of row z and all of column z). 

10 

TRSPA-2 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

15 

where 

A, B and C denote restriction enzyme cutting sites. 1,2,3, 4, 
5, 6, 7, 8 and 9 denote the restriction fragments after digestion. 



20 The TRSPA-2 test matrices 

There are 3 C 2 =3 TRSPA-2 test matrices - AB, BC and AC. 

The test matrix hybridisation patterns 

There are {(2-(2+1))/2}=3 possible hybridisation patterns for 
25 each of the test matrices 



b a 




The combinatorial diversity 

If there are three possible TRSPA-2 test matrix hybridisation 
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patterns and three possible test matrices, then there will be 3 3 =27 possible 
TRSPA-2 signatures. 

Fragment 4 analysis - the AB TRSPA-2 matrix 

5 Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and B to give 

io 

-C-1-B 
B-2-B 
B-3-A 
A-4-A 
15 A-5-C-6-C-7-B 
B-8-A 
A-9-A- 

The fragment complementary to fragment 4 will be 

20 

A-4-A 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 

25 




Fragment 4 analysis - the BC TRSPA-2 matrix 

Take the dsDNA 
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-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with B and C to give 

-C-1-B 

s B-2-B 

B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

10 

The fragment complementary to fragment 4 will be 
B-3-A-4-A-5-C 

15 Polymerase chain reaction amplify and hybridise with just 

fragment 4 to give 



BC matrix 

B 

C 



Fragment 4 analysis - the AC TRSPA-2 matrix 

20 Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 

-c 

C-1-B-2-B-3-A 
A-4-A 
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10 



A-5-C 

C-6-C 
C-7-B-8-A 
A-9-A 
A- 



The fragment complementary to fragment 4 will be 



A-4-A 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 



■n 



15 Fragment 5 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



20 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
25 A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 
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The fragment complementary to fragment 5 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 



Fragment 5 analysis - the BC TRSPA-2 matrix 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
C-6-C 
20 C-7-B 

B-8-A-9-A- 



The fragment complementary to fragment 5 will be 



25 B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 




10 



Take the dsDNA 



15 
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C 



BC matrix 



B 




Fragment 5 analysis - the AC TRSPA-2 matrix 



Take the dsDNA 



-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and C to give 



10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 



The fragment complementary to fragment 5 will be 



Polymerase chain reaction amplify and hybridise with just 



fragment 5 to give 



20 



A-5-C 
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10 




15 



20 



Fragment 6 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



Fragment 6 analysis - the BC TRSPA-2 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
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Cut with B and C to give 

-C-1-B 
B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io The fragment complementary to fragment 6 will be 

C-6-C 

Polymerase chain reaction amplify and hybridise with just 
15 fragment 6 to give 



BC matrix 

B 

C 



Fragment 6 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 

20 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 -C 

C-1-B-2-B-3-A 
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C-6-C 
C-7-B-8-A 
A-9-A 
A- 



The fragment complementary to fragment 6 will be 



C-6-C 



io Polymerase chain reaction amplify and hybridise with just 

fragment 6 to give 



AC matrix 



15 



Overall results 



Fragment 4 TRSPA-2 analysis 



AB 



BC 



AC 






Fragment 5 TRSPA-2 analysis 



20 



AB 



BC 



AC 



H3 bfl 
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Fragment 6 TRSPA-2 analysis 

AB BC AC 



The number of test matrices 

5 For a total of n enzymes used for fragmentation and a panel 

of r enzymes per (rxr) test matrix, there will be n C r possible (rxr) test 
matrices, where 

n C r =(n!)/[(n-r)!.r!] 

o 

n C r for various n and r is given in the following table 





n= 


r 


3 


4 


5 


6 


7 


8 


9 


10 . 


1 


3 


4 


5 


6 


7 


8 


9 


10 


2 


3 


6 


10 


15 


21 


28 


36 


45 


3 


1 


4 


10 


20 


35 


56 


84 


120 


4 




1 


5 


15 


35 


70 


126 


210 


5 






1 


6 


21 


56 


126 


252 


6 








1 


7 


28 


84 


210 


7 










1 


8 


36 


120 


8 












1 


9 


45 


9 














1 


10 


10 
















1 



The number of TRSPA spots arrayed 

5 The total number of TRSPA spots is given by 




r 2 .{ n C r } = r 2 -{(n!)/[(n.r)!.r!]} 
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^-{"Cr} for various n and r is given in the following table 





n = 


H 


3 


4 


5 


6 


7 


8 


9 


10 


1 


3 


4 


5 


6 


7 


8 


9 


10 


2 


12 


24 


40 


60 


84 


112 


144 


180 


3 


9 


36 


90 


180 


315 


504 


756 


1,080 


4 




16 


80 


240 


560 


1,120 


2,016 


3.360 


5 






25 


150 


525 


1,400 


3,150 


6,300 


6 








36 


252 


1,008 


3.024 


7.560 


7 










49 


392 


1,764 


5.880 


8 












64 


576 


2.880 


9 














81 


810 


10 
















100 



Each test as above will give rise to a particular hybridisation 

5 'signature'. 

TRSPA-1 TRSPA-2 
(for n=6 and r=3) (for n=6 and r=3) 

i I m III! nn I I I I MB I M I I M I MB 

i i i i I ■ I ■TTi MM MM W m W ■ i i i f ■ 

Mai cqj qju tni id an nil — 

i m i 1 M 1 Mil m I M I II I I I I II OH 

■ M i i ■ FTjI m I i mil 1 1 ■ Qj| KX] 

□ZjDOCjODZj uJj lMj lMj QIj 

HHEBEB1BB BBHHJHB 

IMJ MM | M I pcD 14-U m 

Em hhfl HH tttd HB fcfcfl Kd 

i mi i m i ran iim i m i rrw m 1 1 ii 

■ » 1 1 1 1 1 m m m m m Qj| 
|0 QXI lljI QIj QC leu Hi CCD ljC 

There will be {1+2+...+r} = {r(r+1)/2} patterns per (rxr) test 
matrix. If there are n C r (rxr) test matrices, the total possible number of 
signatures will be given by {r(r+1 )/2} raised to the power n C r . We require 
i5 that the number of such different signatures is greatly in excess over the 
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number of fragments generated upon fragmentation. 

If we choose n=6 and r=3, there will be 6 possible 
hybridisation patterns per test and 20 possible tests - giving a combinatorial 
diversity of 6 20 = 3.7-1 0 15 . The preferred scheme therefore employs six 
5 enzymes, tested in groups of three - giving 180 spots per experiment. In 
order to avoid restriction fragment length polymorphism problems, a 
duplicate analysis could be performed with a different set of enzymes 
sharing no cutting sites in common with the first set. 

io Example restriction enzyme sets for the preparation of test matrices 
for TRSPAs 



Criteria for enzyme choice 

In order to make TRSPAs, the selection of suitable enzymes 
15 is an important factor. Ideally, two sets of different enzymes are required to 
eliminate the small possibility that a phenotype-determining polymorphism 
might fall within a chosen restriction site and therefore compromise the 
specificity of the resulting signature. The selection of enzymes can be 
based upon a number of criteria 
20 a) The enzymes should be 6 bp cutters. 

b) Cleavage by any selected enzyme should leave a 4 bp 
overhang at the 5' end. 

c) The selected enzymes in each set should all work efficiently 
under the same buffer conditions. 

25 d) The selected enzymes in each set should ideally work 

efficiently at a single incubation temperature. 

e) The chosen enzymes should be commercially available - 
ideally at concentrations of 10 U / |il or more. 

f) The 5' overhangs left by any two enzymes in the same set 
30 should not be identical. 

g) No enzyme should appear in both sets for TRSPA fabrication. 



WO 00/55364 



PCT/GB00/00916 



-54- 

h) Enzymes should be selected to avoid or minimise the effects 

of mammalian methylation patterns. In particular, enzymes with CG 
dinucleotides in their recognition sites should be avoided unless the 
enzyme is known to be able to restrict m5 CpG sites. 

•> 

DNA methylation 

In vertebrates, DNA is often methylated at the 5 th position of 
cytosine in the sequence of CpG and this is the only chemical modification 
that DNA of vertebrates contains under physiological conditions. By the 

]o careful selection of enzymes which do not contain CpG sequences within 
the recognition site, or the selection of enzymes which freely restrict m5 CpG 
methylated sites, it is possible to remove the potentially adverse effects of 
DNA methylation from the TRSPA analysis. 6 bp cutters which are known 
to restrict m5 CpG modified DNA efficiently to leave a 5' four base overhang 

15 are BspEI and Xmal. These enzymes are therefore the only enzymes with 
restriction sites containing CpG dinucleotides that are potentially useful in a 
TRSPA analysis. 

Enzyme selection method 

20 Sixteen possible four base pair overhangs exist (excluding 

unusual enzymes with asymmetrical recognition sequences such as BssSI 
or Ss/7), five of which contain CG in the sequence. A further four 
overhangs could potentially contain CG sequences within the restriction 
recognition site if preceded by a C and followed by a G. Enzymes are 

25 therefore preferentially selected from the remaining six groups. 

Excluding isoschizomers, there are up to four possible 
enzymes which would leave a particular 5' four bp overhang. For example, 
enzymes leaving a CTAG overhang are 
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Avrll - CCTAGG 

Nhel - GCTAGC 

Spel - ACTAGT 

Xbal - TCTAGA 

Enzymes to cleave sites with all the combinations of flanking 
bases are not available for all overhangs - hence the enzyme choice is 
more limited for some overhang groups than others. 

As a primary step towards enzyme selection, the enzymes 

io supplied by Amersham Pharmacia Biotech, New England Biolabs and 
Promega are ordered below by overhang sequence. Supplementary 
details such as the percentage activity in common buffers, the reaction 
temperature, concentration and supplier are also recorded. For three 
overhang sequences, there are no available enzymes, the remaining 13 

15 are described below. Enzymes considered unsuitable due to methylation 
sensitivity are shaded (darker shade). Enzymes considered unfavourable 
due to the presence of CpG sites even though they do restrict methylated 
DNA to some degree are also shaded (lighter shaded). 

20 Candidate restriction enzymes leaving a 5' overhang of GTAC 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/(il 


Supplier 


Inactivate 
°C/min 


Acc65l 


GGTACC 


10/50/100 


37 


10 


NEB 


65/20 


BsiWI 


CGTACG 


100(50)/10q(50)/25(-) 


; 55(37^: 


^•12;..:^ 






BsrGI 


TGTACA 


10/100/100 


37 


10 


NEB 


80/20 


Spit 


CGTACG, 

■> : : ; ;i'v> * /-J. . ■ ' ■ .Vi-i 

: ; i ^ ■ • '■. 


, 10CV2Q«^ ^ 
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Candidate restriction enzymes leaving a 5' overhang of TTAA 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m : C 


U/ul 


Supplier 


Inactivate 
c C/min 


Aflll 


CTTAAG 


25/100/100 


37 


10 


APB 


60/15 


Candidate restriction enzymes leaving a 5' overhang of AATT 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/ul 


Supplier 


Inactivate 
°C/min 


EcoRI 


GAATTC 


100/100/100 


37 


>40 


APB/NEB 


65/20 


Mfel 


CAATTG 


10/50/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of CATG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U/pl 


Supplier 


Inactivate 
°C/min 


BspHI 


TCATGA 


50/100/100 


37 


10 


NEB/APB 


65/20 


Ncoi 


CCATGG 


100/100/100 


37 


50 


NEB/APB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of GATC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C 


U / Ml 


Supplier 


Inactivate 
°C/min 


BamHl 


GGATCC 


50/100/75 


37 


>40 


APB/NEB 


80/20 


Bell 


TGATCA 


100/100/75 


50 (37) 


10 


NEB 


No 


Bglll 


AGATCT 


100/75/10 


37 


>40 


APB/NEB 


No 
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Candidate restriction enzymes leaving a 5' overhang of CCGG 



Enzyme 


Site 


H/M/T buffer + BSA 
°b activity 


Optimu 
m C C 


U/ul 


Supplier 


Inactivate 
; C/min 


BspEI 


TCCGGA 


100/10/0 


37 


10 


NEB 


80/20 


NgoMIV 


GCCGGC 


10/50/100 


37 


10 


NEB > 


■ ; W#2$$^r 


Xmal 


CCCGGG 


0/100/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of GCGC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/ul 


Supplier 


Inactivate 
c C/min 


Kasl 


GGCGCC 


75/100/75, 


37 


5,. 


NEB ; ; 





Candidate restriction enzymes leaving a 5* overhang of TCGA 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m c C 




Supplier 


Inactivate 
°C/min 


PaeR7l 


CTCGAG 


10/100/100:, 


37 


.2o:>- ; , 






Sail 


GTCGAC 


50/0/0% . 


37 


>40- 


APB/NEB 




Xhol 


CTCGAG 


100/100/100 


37 


>40 


APB/NEB 





io Candidate restriction enzymes leaving a 5' overhang of AGCT 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m °C ■ 


U/uJ 


Supplier 


Inactivate 
X/min 


Hindill 


AAGCTT 


10/100/50 


37 


>40 


NEB/APB 


65/20 
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Candidate restriction enzymes leaving a 5' overhang of CGCG 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m Z C 


U / ul 


Supplier 


Inactivate 
c C/min 


BssHII 


GCGCGC 


100/100/100 


50 


20 


NEB/APB 




Mlul 


ACGCGT 


100/75/50 


37 


10 


APB/NEB 




Candidate restriction enzymes leaving a 5' overhang of GGCC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m ; C 


U/uJ 


Supplier 


Inactivate 
°C/min 


Eagl 


CGGCCG 


100/25/10 


37 


50 : 


; _. NEB;:;.; 




Eco52l 
(Eagl) 


CGGCCG 


20/20/20, 




1£L : 






Candidate restriction enzymes leaving a 5 f overhang of TGCA 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/|il 


Supplier 


Inactivate 
°C/min 


ApaLI 


GTGCAC 


10/100/100 


37 


10 


NEB/APB 


70/15 


Alw44l 


GTGCAC 


-/50A 


37 


10 


Promega 


70/15 


Candidate restriction enzymes leaving a 5' overhang of CTAG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


U/ul 


Supplier 


Inactivate 
°C/min 


Avrll 


CCTAGG 


50/100/100 


37 


4 


NEB 


No 


Nhel 


GCTAGC 


10/100/100 


37 


10 


APB/NEB 


65/20 


Spel 


ACTAGT 


25/100/75 


37 


50 


NEB 


65/20 


Xbal 


TCTAGA 


75/100/75 


37 


100 


NEB 


65/20 


Blnl(Avril) 


CCTAGG 


40/20/20 


37 


10 


APB 


No 
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From the above tables, a short-list of the most useful 
enzymes is given below for each of the buffer conditions shown 



Overhang 


Buffer H + BSA 


Buffer M + BSA 


Buffer T + BSA 




Enzyme 


Efficiency 


Enzyme 


Efficiency 


Enzyme 


Efficiency 


AATT 


EcoRI 


100 


EcoRI 
Mfel 


100 
50 


Mfel 

- 


100 

- 


CATG 


Ncol 


100 


Ncol 


100 


Ncol 


100 




BspHI 


50 


BspHI 


100 


BspHI 


100 


GATC 


Bglll 


100 


BamHI 


100 


BamHI 


75 




BamHI 


50 


Bglll 


75 






TATA 


- 


- 


- 


- 


- 


- 


ACGT 


- 


- 


- 


- 


- 


- 


CCGG 


BspEI 


100 


Xmal 


100 


Xmal 


100 


GCGC 


- 


- 


- 


- 


- 


- 


TCGA 














AGCT 


- 


- 


Hindlll 


100 


Hindlll 


50 


CGCG 


- 


- 


- 


- 


- 


- 


GGCC 














TGCA 






ApaLI 


100 


ApaLI 


100 


ATAT 














CTAG 


Xbal 


75 


Xbal 


100 


Nhel 


100 




Avrll 


50 


Spel 


100 


Avrll 


100 




Blnl 


40 


Nhel 


100 


Xbal 


75 








Avrll 


100 


Spel 


75 


GTAC 


Acc65l 


100 


BsrGI 
Acc65l 


100 
75 


BsrGI 


100 


TTAA 






Alill 


100 


Aflll 


100 



Taking into account all of the criteria explained above, an 
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example selection of two six-enzyme sets is described below. Reserve 
enzymes, which could also be used, are shown as well. These reserve 
enzymes can be substituted (provided this does not cause overhang 
duplication) if practical problems regarding enzyme availability or 
performance should occur. 



Example enzyme set 1 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U 


Supplier 


Inactivate 
°C/min 


1 


BamHI 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGI 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


Hindlli 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncol 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spel 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Aflll 


CTTAAG 


100 


37 


10 

- 


APB 


60/15 



io Example enzyme set 2 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 
C C 


U/ul 


Supplier 


Inactivate 
°C/min 


1 


EcoRI 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHI 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bgllt 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbal 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65l 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaU 


GTGCAC 


100 


37 


10 


NEB 


No 
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1. Example reserve enzymes 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U/ul 


Supplier 


Inactivate 
c C/min 


r1 


Xmal 


CCCGGG 


100 


37 


10 


NEB 


65/20 


r2 


BspEl 


TCCGGA 


100 


37 


10 


NEB 


80/20 


r3 


Nhel 


GCTAGC 


100 


37 


10 


APB 


65/20 


r4 


Avrll 


CCTAGG 


100 


37 


4 


NEB 


No 



Example specific (3x3) terminal restriction site profiling array (TRSPA) 
test matrices for above two sets of six enzymes 

Triplet combinations from BamHI, BsrGI, Hindlll, Ncol, Spel and Aflll 

For example set 1 - BamHI, BsrGI, Hindlll, Ncol, Spel and 
Aflll - the 20 (= 6 C 3 ) triplet combinations are 



BamHI BsrGI Hindfll 
BamHI BsrGI Ncol 
BamHI BsrGI Spel 
BamHI BsrGI Aflll 
BamHI Hindlll Ncol . 



BamHI Hindlll Spel 
BamHI Hindlll Aflll 
BamHI Ncol Spel 
BamHI Ncol Aflll 
BamHI Spel Aflll 



BsrGI Hindlll Ncol 
BsrGI Hindlll Spel 
BsrGI Hindlll Aflll 
BsrGI Ncol Spel 
BsrGI Ncol Aflll 



BsrGI Spel Aflll 
Hindlll Ncol Spel 
Hindlll Ncol Aflll 
Hindlll Spel Aflll 
Ncol Spel Aflll 



Triplet combinations from EcoRI, BspHI, Bglll, Xbal, Acc65l and ApaLI 

For example set 2 - EcoRI, BspHI, Bglll, Xbal, Acc65l and 
ApaLI - the 20 (= 6 C 3 ) triplet combinations are 



EcoRI BspHI Bglll 
EcoRI BspHI Xbal 
EcoRI BspHI AccSSl 
EcoRI BspHI ApaU 
EcoRI Bglll Xbal 



EcoRI Bglll Acc65l 
EcoRI Bglll ApaU 
EcoRI Xbal Acc65l 
EcoRI Xbal ApaLI 
EcoRI Acc65l ApaLI 



BspHI Bglll Xbal 
BspHI Bglll Acc65l 
BspHI Bglll ApaU 
BspHI Xbal Acc65l 
BspHI Xbal ApaLI 



BspHI Acc65l ApaU 
Bglll Xbal Acc65l 
Bglll Xbal ApaU 
Bglll Acc65l ApaU 
Xbal Acc65l ApaLI 
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Example TRSPA-1 test matrices for set 1 - BamHI, BsrGI, Hindlll, Ncol, 
Spel and AWI 



matrix 1 


5'-HO-BamHAprimer 


5'-HO-SsrG/-primer 


5'-HO-H/nd///-primer 


S'-biotin-SamHAprimer 


1 


2 


3 


S'-biotirvBsrGAprimer 


4 


5 


6 


5'-biotin-H//?c///Aprimer 


7 


8 


9 



matrix 2 


S'-HO-BamHAprimer 


5-HO-BsrGAprimer 


S-HO-Nco /-primer 


S'-biotin-BamHAprimer 


1 


2 


3 


S'-biotin-BsrGAprimer 


4 


5 


6 


5-biotin-Wco/-prrmer 


7 


8 


9 



3 



matrix 3 


5 ! -HO-BamHAprimer 


5'-HO-BsrGAprimer 


S'-HO-SpeAprimer 


S'-biotin-SamHAprimer 


1 


2 


3 


S'-biotin-BsrGAprimer 


4 


5 


6 


5'-biotin-SpeAprimer 


7 


8 


9 



matrix 4 


S'-HO-BamHAprimer 


S'-HO-BsrGAprimer 


S'-HO-A ////-primer 


S'-biotin-BamHAprimer 


1 


2 


3 


5'-biotin-0srG /-primer 


4 


5 


6 


5'-biotirMff/Aprrmer 


7 


8 


9 



matrix 5 


5*-HO-SamHAprimer 


5'-HO-/-//nd///-primer 


5*-HO-/Vco/-primer 


5'-biotin-BamHApnmer 


1 


2 


3 


S'-biotin-H/ncWAprimer 


4 


5 


6 


5'-biotin-/VcoAp rimer 


7 


8 


9 



matrix 6 


5*-HO-BamHAprimer 


S'-HO-H/'nctf/Aprimer 


5*-HO-SpeAprimer 


S'-biotin-Ba/nHAp rimer 


1 


2 


3 


5'-biotin-H/nd///-p rimer 


4 


5 


6 


5'-biotin-SpeAprimer 


7 


8 


9 



matrix 7 


S'-HO-BamHAprimer 


5'-HO-H/>7cy///-primer 


S'-HO-zArV/Aprimer 


5'-biotin-BamHAprimer 


1 


2 


3 


5'-biotin-H/nd///-pnmer 


4 


5 


6 


S'-biotin-^tf/Aprimer 


7 


6 


9 
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matrix 8 


S'-HO-BamHAprimer 


5-HO-A/co/-primer 


5'-HO-Spe/-primer 


5'-biotin-SamH/-primer 


1 


2 


3 


5'-biotin-A/co/-primer 


4 


5 


6 


5'-biotin-Spe/-primer 


7 


8 


9 



matrix 9 


5'-HO-Sa/7?H/-primer 


5'-HO-A/co/-primer 


5-HO->W//-primer 


5 ! -biotin-SamH/-primer 


1 


2 


3 


5'-biotin-A/co/-prrrner 


4 


5 


6 


5'-biotin-/AW/-primer 


7 


8 


9 



matrix 10 


5'-HO- BamHAprimer 


5'- HO- Spe /-primer 


5'-HCM////-primer 


S'-biotin-SamHZ-pnmer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5-btotin-/A////-primer 


_ 7 


8 


9 



matrix 1 1 


5-HO-Ss/-G/-primer 


5-HO-W/nc///Aprimer 


5-HO-/VcoAprimer 


S'-biotin-BsrGAprimer 


1 


2 


3 


5'-biotin-H/nd//Aprimer 


4 


5 


6 


5'-biotin-A/co/-prirner 


7 


8 


9 



matrix 1 2 


5'-HO-BsrGAprimer 


S'-HO-Hmctf/Aprimer 


5'- HO- Spe/- primer 


S'-biotin-SsrGAprimer 


1 


2 


3 


5'-biotin-H/nd//Apnrner 


4 


5 


6 


5-biotin-SpeAprimer 


7 


8 


9 



5 



matrix 13 


5-HO-Bs/-G/-primer 


S'-HO-H/ncW/Aprimer 


5'-HO->tf//Aprimer 


5'-biotin-Bs/"GAprimer 


1 


2 


3 


5'-btotin-H/nd///-primer 


4 


5 


6 


5'-biotin-/\///Aprimer 


7 


8 


9 



matrix 14 


S'-HO-BsrGAprimer 


5'-HO-A/co/-primer 


5'-HO-SpeAprimer 


S'-biotin-flsrGAprimer 


1 


2 


3 


5'-biotin-/VcoAprimer 


4 


5 


6 


S'-biotin-SpeAprimer. 


7 


8 


9 



matrix 15 


S'-HO-SsrGAprimer 


5'-HO-A/coAprimer 


S'-HO-ZW/Aprimer 


5'-biotin-BsrGAprimer 


1 


2 


3 


S'-biotin-A/coAprimer 


4 


5 


6 


5'-biotin->W//Aprimer 


7 


8 


9 
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matrix 16 


5'-HO-BsrG/-primer 


5-HO-Spe/-primer 


5'-HO-4////-primer 


5-biotin-BsrG /-primer 


1 


2 


3 


5-biotin-Spe/-primer 


4 


5 


6 


5'*biotin-/4////-primer 


7 


8 


9 



matrix 17 


5*-HO-H/>7d///-primer 


5"-HO-A/co/-primer 


5-HOSpe/-primer 


5'-biotin-/-//nd///-p rimer 


1 


2 


3 


5'-biotin-A/co/-primer 


4 


5 


6 


S'-biotin-SpeAprimer 


7 


8 


9 



matrix 18 


5 l -HO-H/nd///-primer 


5'-HO-/Vco/-primer 


5'-HO->4/7//-primer 


5'-biotin-H/nd///-primer 


1 


2 


3 


5'-biotin-/Vco/-primer 


4 


5 


6 


S'-biotin-zA////- primer 


7 


8 


9 



matrix 19 


5'-HO-H/nd///-primer 


5'-HO-Spe/-primer 


S'-HCMrV/Aprimer 


5*-biotin-H/nc////*primer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5'-biotin-/tf///-primer 


7 


8 


9 



matrix 20 


S'-HO-A/coAprimer 


S'-HO-SpeAprimer 


5'-HO->W//-primer 


5'-biotin-/Vco/-primer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5'-biotin-^ff//-primer 


7 


8 


9 



5 

Example TRSPA-1 test matrices for set 2 - EcoRI, BspHt, Bglll, Xbal, 
Acc65l and ApaLI 



matrix 21 


S'-HO-EcoflAprimer 


S'-HO-BspHAprimer 


5'-HO-B0//Aprimer 


5'-biotin- EcoflAprimer 


1 


2 


3 


S'-biotin-BspHAprtmer 


4 


5 


6 


5'-biotin-Bg//Aprirner 


7 


8 


9 



matrix 22 


5-HO-EcoflAprimer 


S'-HO-BspHAprimer 


5'- HO- Xba /-primer 


5'-biotin-£coflAprimer 


1 


2 


3 


5'-biotin-BspHAprimer 


4 


5 


6 


S'-biotin-XfcaAprimer 


7 


8 


9 
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matrix 23 


5-HO-Ecofl/-primer 


S'-HO-BspHZ-primer 


5'-HCMcc65/-primer 


5-biotin-Ecofi/-primer 


1 


2 


3 


5 ! -biotin-BspH/-primer 


4 


5 


6 


5'-biotin-y4cc65/-primer 


7 


8 


9 



matrix 24 


5*-HO-Ecofl/-primer 


5'-HO-flspH/-primer 


5 ! -HO-4paL/-primer 


5'-biotin-£cofl/-primer 


1 


2 


3 


5'-biotin-BspH/-primer 


4 


5 


6 


5'-biotin-ApaL/-primer 


7 


8 


9 



matrix 25 


5*-HO-Ecotf /-primer 


5 , -HO-Sg///-primer 


5'-HO-X£?a/-primer 


5-biotin-Ecofl/-primer 


1 


2 


3 


5 ; -biotin-Bg///-primer 


4 


5 


6 


5'-biotin-Xba/-primer 


7 


8 


9 



matrix 26 


5*-HO-Ecofl/-primer 


5'-HO-Bp///-primer 


5'-HO-/\cc65/-primer 


5*-biotin-Ecofl/-primer 


1 


2 


3 


5*-biotin-Bg///-primer 


4 


5 


6 


5*-biotin-^cc65/-primer 


7 


8 


9 



matrix 27 


5'-HO-Ecof?/-primer 


a'-HO-SpM-primer 


5'-HO-ApaL /-primer 


5' -biotin- Ecofl/- primer 


1 


2 


3 


5'-biotin-Bp///-primer 


4 


5 


6 


S'-biotin-zApa/./- primer 


7 


8 


9 



5 



matrix 28 


5'-HO-£cofl/-primer 


5'- HO- Xba /-primer 


5*-HO-Acc65/-primer 


5'-biotin-Ecofl/-primer 


1 


2 


3 


5'-biotin-Xba/-primer 


4 


5 


6 


S'-biotin-AccGS/'primer 


7 


8 


9 


matrix 29 


5*-HO-Ecofl/-primer 


5'-HO-Xfca/-primer 


5*- HCMpa/./- primer 


5'-biotin-£co/?Aprimer 


1 


2 


3 


5'-biotin-Xi>aAprimer 


4 


5 


6 


5'-btotin-/\pa/_ /-primer 


7 


8 


9 



matrix 30 


5'-HO-£cofl/-primer 


5'-HO-y4cc65/-primer 


5'-HO-/\paL/-primer 


S'-biotin-Ecoffl-primer 


1 


2 


3 


5'-biotin-/Acc65/-primer 


4 


5 


6 


5'-biotin->ApaL/-primer 


7 


8 


9 
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matrix 31 


5'-HO-SspH/-primer 


5*-HO-B£///-primer 


5'-HO-X£a/-primer 


5 '-biotin-Ssptf/- primer 


1 


2 


3 


5'-biotin-Bg///-primer 


4 


5 


-6 


5-biotin-Xba/-primer 


7 


8 


9 



matrix 32 


5"-HO-SspH/*primer 


5'-HOSp///-primer 


5'-HCMcc65/-primer 


5'-biotin-0spH/-primer 


1 


2 


3 


5"-biotin-Sg///-primer 


4 


5 


6 


5*-biotin->4cc65/-primer 


7 


8 . 


9 



matrix 33 


5'-HO-SspH/-primer 


o"- HO- Bg///-primer 


5'- HQ-ApaL /-primer 


5'-biotin-8spH/-primer 


1 


2 


3 


5'-biotin-Bg///-primer 


4 


5 


6 


5'-biotin-/4pa/_/-primer 


7 


8 


9 



matrix 34 


5'-HOBspH/-primer 


5-HO-Xba/-primer 


5'-HO-v4cc65/-primer 


5'-biotin-8spH/-primer 


1 


2 


3 


5'-btotin-XbaAprimer 


4 


5 


6 


5'-biotin-y4cc65/-primer 


7 


8 


9 



matrix 35 


5 ' -HO- BspH I- primer 


5-HO-X6a/-primer 


5'-HCMpaL/-primer 


5'-biotin-BspH/-primer 


1 


2 


3 


5 T -biotin-Xba/-primer 


4 


5 


6 


5'-biotin-/4pa/J-primer 


7 


8 


9 



5 



matrix 36 


5'-HO-Ssp/-//-primer 


5'-HO-/lcc65/-primer 


5'-HO-y4paL/-primer 


5'-biotin-BspH/-primer 


1 


2 


3 


5'-biotin->Acc65/-primer 


4 


5 


6 


S'-biotirMpaZ. /-primer 


7 


8 


9 



matrix 37 


5'-HO-8g///-primer 


5'-HO-X£>a/-primer 


5*-HO-/4cc65Aprimer 


5'-biotin-ep///-primer 


1 


2 


3 


5'-biotin-X6a/-primer 


4 


5 


6 


S'-biotin-yAccSSZ-primer 


7 


8 


9 



matrix 38 


5'-HO-ep///-primer 


5-HO-X£?a/-primer 


5'- HO-ApaU- primer 


5'-biotin- Bg///-primer 


1 


2 


3 


5'-biotin-X6a/-primer 


4 


5 


6 


5*-biotin-/\paL /-primer 


7 


8 


9 
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matrix 39 


5'-HO-Sg//Apnmer 


5*- HO-/Acc65/-primer 


5- HO-ApaL /-primer 


5-biotin-fip///-primer 


1 


2 


3 


5'-biotin-4cc65/-primer 


4 


5 


6 


5*-biotin-y4pa/_ /-primer 


7 


8 


9 



matrix 40 


5'-H0-XfcaAprimer 


5'-HO->4cc65Aprimer 


5- HO-ApaL /-primer 


5' -biotin-Xfca /-primer 


1 


2 


3 


5-biotin--Acc65/-primer 


4 


5 


6 


S'-biotin-ApaZ-Aprimer 


7 


8 


9 



Example TRSPA-2 test matrices for set 1 - BamHI, BsrGI, Hindlll, Ncol, 
Spel and Aflll 



matrix 1 


5'-HO- SamtfAprimer 


5-HO-BsrGAprimer 


5-HO-H/ncf//Aprimer 


5*-HO-SamH/-primer 


1 


2 


3 


5'-HO-BsrGAprimer 


4 


5 


6 


5'-HO-H/nc///Aprimer 


7 


8 


9 



matrix 2 


S'-HO-SamHAprimer 


5'-HO-S$/-G/-primer 


S'-HO-A/coAprimer 


5'-HO-0amHAprimer 


1 


2 


3 


S'-HO-BsrGAprimer 


4 


5 


6 


5'-HO-/VcoAprimer 


7 


8 


9 



matrix 3 


5' -HO* BamHI- primer 


S'-HO-BsrGAprimef 


5"-HO-SpeAprimer 


S'-HO-BamHAprimer 


1 


2 


3 


S'-HO-BsrGAprimer 


4 


5 


6 


5*-HO-SpeAprimer 


7 


8 


9 



matrix 4 


S'-HO-BamHAprimef 


5*-HO-6srGAprimer 


S'-HO-Afl/Aprimer 


S'-HO-BamHAprimer 


1 


2 


3 


S'-HO-BsrGAprimer 


4 


5 


6 


5*-HO-/W/Aprimer 


7 


8 


9 



matrix 5 


S'-HO-SamHAprimer 


5'-HO-H/nd//Aprimer 


5'-HO-/VcoAprimer 


S'-HO-BamHAprimer 


1 


2 


3 


S'-HO-H/nd/ZAprinier 


4 


5 


6 


5*-HO-A/coAprimer 


7 


8 


9 



10 
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matrix 6 


5'-HO-SamH/-primer 


5 " - H 0 - Hindlll- p ri m e r 


5'-HO-Spe/-primer 


5'-HO-SamH/-primer 


1 


2 


3 


5'-HO-H/ntf///-primer 


4 


5 


6 


S'-HO-Spe/'primer 


7 


8 


9 



matrix 7 


5'-HOSamH/-primer 


5-HO-H/ntf///-primer 


5-HO-/A////-primer 


5' -HO- BamHI- primer 


1 


2 


3 


5 , -HO-H/V7d///-primer 


4 


5 


6 


5'-HO-4////-primer 


7 


8 


9 



matrix 8 


5'-HO-BamH/-primer 


5'-HO-/Vco/-primer 


5-HO-Spe/-primer 


b* -HO- BamHI-p rimer 


1 


2 


3 


5'-HO-/Vco/-primer 


4 


5 


6 


5"-HO-Spe/-primer 


7 


8 


9 



matrix 9 


5'-HO-0amH/-primer 


5-HO-A/co/-primer 


S'-HCMMAprimer 


5'-HO-SamH/-primer 


1 


2 


3 


5'-HO-A/co/-primer 


4 


5 


6 


5'-HCM////-primer 


7 


8 


9 



matrix 10 


5 , -HO-Sam7-//-primer 


5'-HO-Spe/-primer 


5*-HO-/tf///-primer 


5'-HO-BamH/-primer 


1 


2 


3 


5'-HO-Spe/-primer 


4 


5 


6 


5'-HO/tf///-primer 


7 


8 


9 



5 



matrix 1 1 


5'-HO-SsrG/-primer 


5-HO-H/'ncW/-primer 


5'-HO-A/co/-primer 


5'-HO-0srt3/-primer 


1 


2 


3 


5'-HO-H/nd///-primer 


4 


5 


6 


5*-HO-A/co/-primer 


7 


8 


9 



matrix 12 


5'-HO-BsrG/-primer 


5 , *HO-H/'ncf///-primer 


5'-HO-Spe/-primer 


5'-HO-BsrG/-primer 


1 


2 


3 


5'-HO-H/nd///-primer 


4 


5 


6 


5*-HO-Spe/-primer 


7 


8 


9 



matrix 13 


5 T -HO-SsrG/-primer 


5-HO-H/nd///-primer 


5'-HO-/tf///-primer 


S'-HO-Bsz-GAprimer 


1 


2 


3 


5'-HO-H/nd///-primer 


4 


5 


6 


5'-HO-/W/-primer 


7 


8 


9 
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matrix 14 


5'-HO-BsrG/-primer 


5'-HO-A/co/-primer 


5*-HO-Spe/-primer 


5*-HO-esrG/-primer 


1 


2 


3 


5'-HO-/Vco/-primer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


. 9 



matrix 15 


5-HO-SsrG/-primer 


5'-HO-A/co/-primer 


5'-HO-A/7//-primer 


5'-HO-BsrG/-primer 


1 


2 


3 


5'-HO-/Vco/-primer 


4 


5 


6 


5'-HO-/W-primer 


7 


8 


9 



matrix 16 


5'-HO- 8srG/-primer 


5'-HO-Spe/-primer 


5'-HO-/tf///-primer 


5'-HO-SsrG/-primer 


1 


2 


3 


5-HO-Spe/-primer 


4 


* 5 


6 


5-HO-Ar7//-primer 


7 


8 


9 



matrix 17 


5*-HO-H/nc(///-primer 


5'- HO- /Vco/- primer 


5-HO-Spe/-primer 


5'-HO-H/no7//-primer 


1 


2 


3 


5'-HO-A/co/-primer 


4 


5 


6 


5*-HO-Spe/-primer 


7 


8 


9 



matrix 18 


5'-HO-H/otf///-primer 


5'- HO- /Vco/- primer 


5'-HO-/tf///-primer 


5'-HO-H/nd///-primer 


1 


2 


3 


5*HO-A/co/-primer 


4 


5 


6 


5*-HO-y4tf//-primer 


7 


8 


9 



5 



matrix 19 


5'-HO-H/>7c////-primer 


5'-HO-Spe/-primer 


5'-HO-/\////-primer 


5 , -HO-Hlnd///-primer 


1 


2 


3 


5'*HO-Spe/-primer 


4 


5 


6 


5'-HO-yA////-primer 


7 


8 


9 



matrix 20 


5'-HO-A/co/-primer 


S'-HO-SpeZ-primer 


5'-HO-/W//-primer 


5'-HO-/Vco/-primer 


1 


2 


3 


5'-HO-Spe/-primer 


4 


5 


6 


5'-HO-/M///-primer 


7 


8 


9 
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Example TRSPA-2 test matrices for set 2 - EcoRL BspHI, Bglli, Xbal, 
Acc65l and ApaLI 



matrix 21 


5'-HO-Ecofl/-primer 


5'-HO-£JspH/-primer 


5"-HO-Sg///-primer 


5"-HO-Ecofl/-primer 


1 


2 


3 


5 : -HO-6spH/-primer 


4 


5 


6 


5*-HO-Sp///-pnmer 


7 


8 


9 



matrix 22 


5'-HO-Ecof?/-primer 


5'-HO-BspH/-primer 


5"-HO-Xfc>a/-primer 


o'-HO-EcoflAprimer 


1 


2 


3 


5'-HO-BspH/-primer 


4 


5 


6 


5'-HO-Xba /-primer 


7 


8 


9 



^1 



matrix 23 


5'-HO-Ecofl/-primer 


5 MO-BspHl- primer 


B'-HCMcceSZ-primer 


5'-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-8spH/-primer 


4 


5 


6 


5'-HO-4cc65/-primer 


7 


8 


9 



matrix 24 


S'-HO-EcofiZ-primer 


5'-HO-0spH/-primer 


S'-HO-ApaL /-primer 


5*-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-SspH/-primer 


4 


5 


6 


5'-HO-/*pa/./-primer 


7 


8 


9 



matrix 25 


5'- HO Ecof?/- primer 


5'-HO-eg///-primer 


5'-HO-Xba/-primer 


5-HO-£cofl/-primer 


1 


2 


3 


S'-HO-Sp/y/'primer 


4 


5 


6 


5'-HO-Xba/-primer 


7 


8 


9 



matrix 26 


5'-HO-Ecofl/-primer 


5'-HO-8g///-primer 


5'-HCMcc65/-primer 


5'-HO-Ecofl/-primer 


1 


2 


3 


5'-HO-8g//Aprimer 


4 


5 


6 


5'-HO/\cc65/-primer 


7 


8 


9 



matrix 27 


5'-HO-Ecofl/-primer 


5'-HO-Bp//Aprimer 


S'-HCMpaU-primer 


5'-HO-£cofl/-primer 


1 


2 


3 


S'-HO-epZ/Z-primer 


4 


5 


6 


5*-HO/4pa/./-primer 


7 


8 


9 



10 
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matrix 28 


5-HO-EcoR/-primer 


5'-HO-X£>a/-primer 


5'-HO-yAcc65/-primer 


5'-HOEcofl/-primer 


1 


2 


3 


5'-HO-Xba/-primer 


4 


5 


6 


5'-HO-Acc65/-primer 


7 


8 


9 



matrix 29 


5-HO-Ecofl/-primer 


5'-HOXba/-primer 


5'-HO-/4pa/J-primer 


5'-HO-£cofl/-primer 


1 


2 


3 


5-HO-X£?a/-primer 


4 


5 


6 


5'-HO-ApaL/-primer 


7 


8 


9 



matrix 30 


5'-HO-Ecofl/-primer 


5'-HO-Acct>5/-primer 


5-HO-Apa*_/-primer 


5'-HO-Ecoft/-primer 


1 


2 


3 


5'-HO-'Acc65/-primer 


4 


5 


6 


S'-HO-Apa/J-primer 


7 


8 


9 



matrix 31 


5'-HO*6spH/-primer 


5'-HO-Bs///-primer 


5*- HO-X£?a/- primer 


5'-HO-BspH/-primer 


1 


2 


3 


5'-HO-Bp///-primer 


4 


5 


6 


5'-HO-X6a/-primer 


7 


8 


9 



matrix 32 


S'-HO-SspHAprimer 


5*-HO-Bp///-primer 


5'-HCMcc65/-primer 


5'-HOBspH/-primer 


1 


2 


3 


5'-HO-Sg///-primer 


4 


5 


6 


5'-HO-Acc65/-primer 


7 


8 


9 



5 



matrix 33 


5'-HO-BspH/-primer 


5'-HO-Bp///-primer 


5'-HO-Apa/-/-primer 


5'-HO-BspH/-primer 


1 


2 


3 


5*-HO-Bg///-primer 


4 


5 


6 


5'-HO-ApaL/-primer 


7 


8 


9 



matrix 34 


5'-HO-BspH/-primer 


5'-HO-X/?a/-primer 


5'-HO-Acc65/-primer 


5'-HO-BspH/-primer 


1 


2 


3 


5'-HO-Xba/-primer 


4 


5 


6 


5'- HO- Acc65l- primer 


7 


8 


9 



matrix 35 


5'-HO-BspHAprimer 


5'-HO-X6a/-primer 


5'-HO-Apa/./-primer 


5*- HO- BspHI-p rimer 


1 


2 


3 


5'-HO-Xfca/-primer 


4 


5 


6 


5'-HO-ApaL/-primer 


7 


8 


9 
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matrix 36 


5'-riO-SspH/-pnmer 


5'-HO-/4cc65/-primer 


5'- HO-ApaLI- primer 


5*-HO-8spH/-primer 


1 


2 


3 


5"-HO-/Acc65/-primer 


4 


5 


6 


5'-HO-/4pa/_/-primer 


7 


8 


9 



matrix 37 


5"-HO-Bp///-primer 


5- HO- Xbaf- primer 


5"-HO-/4cc65/-primer 


5-HO-Bp///-primer 


1 


2 


3 


5'-HO-X£?a/-primer 


4 


5 


6 


5-HO-/4cc65/-primer 


7 


8 


9 



matrix 38 


5'-HO-Bc?///-primer 


5- HO-X/?a /-primer 


5'-HO-v4pa/_ /-primer 


5-HO-Bg///-primer 


1 


2 


3 


5"-HO-Xba/-primer 


4 


5 


6 


o-HO-Apa /./-primer 


7 


8 


9 



matrix 39 


S'-HO-Bp/ZAprimer 


5'-HO->4cc65/-primer 


5'-HO-/4pa/_/-primer 


5'-HO-0g///-primer 


1 


2 


3 


5'- HO-Acc65I- primer 


4 


5 


6 


5'-HO/lpaU-primer 


7 


8 


9 



matrix 40 


5'- HO -X6a /-primer 


5'- HO-/\cc65/- primer 


S'-HCMpaU-primer 


5'-HO-Xfc>a/-primer 


1 


2 


3 


5'-HO-v4cc65/-primer 


4 


5 


6 


S'-HO- ApaL /-primer 


7 


8 


9 



5 

Hybridisation patterns 

There are six possible hybridisation patterns for a given probe 
fragment from inter-population perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-1 analysis. 
io There are also six possible hybridisation patterns for a given 

probe fragment from inter-population perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-2 analysis. 

We can denote these patterns as follows 



15 
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TRSPA-1 analysis 



Pattern a (1 only) 



matrix # 


5'-HO-X-primer 


5'-HO-Y-primer 


5-HO-Z-primer 


S'-biotin-X-primer 






2 


3 


5*-biotin-Y-primer 


4 


5 


6 


S'-btotin-Z-primer 


7 


8 


9 


Pattern b (2 and 4) 


matrix # 




5-HO-X-primer 


5*-HO-Y-primer 


S'-HO-Z-primer 


5-biotin-X-primer 




1 




3 


5-biotin-Y-primer 






5 


6 


5-biotin-Z-primer 




7 


8 


9 



Pattern c (3 and 7) 



matrix # 


5-HO-X-primer 


S'-HO-Y-primer 


5 : -HO-Z-primer 


S'-biotin-X-primer 


1 


2 




S'-biotin-Y-primer 


4 


5 


6 


5'-biotin-Z-primer 




8 


9 


Pattern d (5 only) 


matrix # 


S'-HO-X-primer 


5-HO-Y-primer 


S'-HO-Z-primer 


5-biotin-X-primer 


1 


2 


3 


S'-biotin-Y-primer 


4 




6 


5'-biotin-Z-primer 


7 


8 


9 



15 
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Pattern e (6 and 8) 



matrix * 


5-HO-X-primer 


5'-HO-Y-primer 


5-HO-Z-primer 


5'-biotin-X-primer 


1 


2 


3 


5-biotin-Y-primer 


4 


5 




5-biotin-Z-primer 


7 


^ 9 


Pattern f (9 only) 


matrix # 


5 ; -HO-X-primer 


5-HO- Y-primer 


S'-HO-Z-primer 


5"-biotin-X-primer 


1 


2 


3 


5-biotin-Y-primer 


4 


5 


6 


5'-biotii>Z-pnmer 


7 


8 





TRSPA-2 analysis 



Pattern a (row 1 and column 1) 



matrix # 


5*-HO-X-primer S'-HO-Y-primer S'-HO-Z-primer 


5'-HO-X-primer 
5'-HO-Y-primer 




5-HO-Z-primer 


MjfM 8 9 



Pattern b (2 and 4) 



matrix # 


S'-HO-X-primer 


5'-HO- Y-primer 


S'-HO-Z-primer 


5 - HO -X-primer 


1 




3 


5'- HO -Y-primer 




5 


6 


5'- HO -Z-primer 


7 


8 


9 
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Pattern c (3 and 7) 



matrix # 


5'-HO-X-primer 


5'-HO-Y-primer 


5'-HO-Z-primer 


5 r - 


HO -X-primer 


1 


2 




5'- 


HO -Y-primer 


4 


5 


6 


5'- 


HO -Z-p rimer 




8 


9 



Pattern d (row 2 and column 2) 



5 



matrix # 


S'-HO-X-primer 


S'-HO-Y-primer 


5'-HO-Z-primer 


5'- HO -X-primer 




5'- HO -Y-primer 


5'- HO -Z-primer 



Pattern e (6 and 8) 



matrix # 


5'-HO-X-primer 


5'-HO-Y-primer 


S'-HO-Z-primer 


5'- HO -X-primer 


1 


2 


3 


5'- HO -Y-primer 


4 


5 




5'- HO -Z-primer 


7 







io Pattern f (row 3 and column 3) 



matrix # 



5'- HO -X-primer 



5'- HO -Y-primer 



5'- HO -Z-primer 



5'-HO-X-primer 



1 



S'-HO-Y-primer 



S'-HO-Z-primer 



15 



Signatures 

If we have 40 such (3x3) test matrices and we denote the 
possible hybridisation patterns a, b, c, d, e and f, then we can write the 
overall TRSPA hybridisation signature as follows 
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Matrix 


Pattern 


matrix 


pattern 


1 


a. b, c. d. e or f 


21 


a, b, c, d. e or f 


2 


a. b, c, d. e or f 


22 


a, b, c. d, e or f 


3 


a. b, c, d, e or f 


23 


a, b, c, d, e or f 


4 


a. b, c, d. 6 or f 


24 


a. b, c, d, e or f 


5 


a h c d eorf 


25 


a, b, c, d, e or f 


6 


a, b, c d, e or f 


26 


a, b, c, d, e or f 


7 


a h r d e or f 


27 


a, b, c, d, e or f 


8 


a h r ri p nr f 

U| L^f i*J T v Ul 1 


28 


a b c d e or f 


9 


a b c d 6 or f 


29 


a, b, c d. e or f 


10 


s b c d eorf 

« U> < Vf Vf w vl 1 


30 


a, b, c, d. e or f 


11 


a, b, c. d. e or f 


31 


a, b. c ( d, e or f 


12 


a, b, c, d, e or f 


32 


a, b, c, d, e or f 


13 


a, b, c, d, e or f 


33 


a. b, c, d, e or f 


14 


a, b, c, d, e or f 


34 


a, b, c, d, e orf 


15 


a, b, c, d, e or f 


35 


a, b, c, d, e or f 


16 


a, b, c, d, e or f 


36 


a, b ( c, d, e or f 


17 


a, b, c, d, e or f 


37 


a, b, c, d, e or f 


18 


a, b, c, d, e or f 


38 


a, b, c, d, eorf 


19 


a, b. c, d, e or f 


39 


a, b, c, d, e orf 


20 


a. b. c. d, e or f 


40 


a. b, c, d, e or f 



Aspect (III) 

In another aspect the invention provides a nucleic acid 
5 characterisation method which comprises presenting to the set of arrays as 
defined above a nucleic acid fragment of interest under hybridisation 
conditions, and observing a pattern of hybridisation. Preferably, a plurality 
of nucleic acid fragments of interest are separately presented to the set of 
arrays, and the resulting patterns of hybridisation are compared. 
io Preferably, the plurality of nucleic acid fragments of interest are drawn from 
the mixture of DNA fragments, enriched in fragments that are characteristic 
of a phenotype of interest, as described under the invention (1) above. 

Thus in this aspect the invention provides a method of 
identifying fragments of DNA that are characteristic of a phenotype of 
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interest, which method comprises recovering, cloning and amplifying 
individual DNA fragments from the mixture of DNA fragments obtained 
under invention (1) above, presenting the individual DNA fragments to the 
set of arrays as defined under hybridisation conditions, observing a pattern 
5 of hybridisation generated by each individual DNA fragment, and subjecting 
to further investigation any two or more individual DNA fragments whose 
hybridisation patterns are similar or identical. 

TRSPA signatures and whole genome association studies 

io After a given number of cycles of inter-population perfectly 

matched duplex depletion, phenotype determining fragments will be 
enriched but will not be entirely free from 'noise' fragments. Noise may 
result from unequal allelic frequencies for certain SNPs between the two 
populations. Noise will also result from the presence of somatic mutations 

15 in the cells used to prepare DNA fragments and from the use of 

polymerase chain reaction in some of the embodiments of the current 
invention. 

For the preferred embodiment, DNA is prepared from a library 
of clones (either genomic clones or cDNA clones) - with inserts derived 

:o from the individual(s) and propagated in some appropriate host and cloning 
vector system. Restriction enzyme fragmentation is used prior to cloning 
and polymerase chain reaction amplification is used to prepare the DNA for 
comparison in fragmented form. Priming sites within the vector sequence 
flanking the cloned restriction enzyme fragmented inserts are employed for 

25 one or more cycles of polymerase chain reaction amplification of the 
fragmented DNA of interest. The primers used for polymerase chain 
reaction amplification of the fragmented DNA of interest are again used 
after the phenotype-determining fragment enrichment process to 'rescue* 
and clone the enriched fragments. Cloned enriched fragments are colony 

30 purified, picked into appropriate storage containers, catalogued and 
archived, DNA probes are prepared from these single clones and are 
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individually hybridised against 180 spot TRSPA arrays as above. TRSPA 
signatures are determined for many colonies. Most noise fragments will be 
random in nature and will thus be randomly distributed amongst the 6 20 - 
3.7-1 0 15 possible types of TRSPA signature. Phenotype-determining signal 
5 fragments, however, will be those where repeat TRSPA signatures are 
obtained as more and more colonies are sampled. The more frequently the 
repetition of a given TRSPA signature occurs per unit colonies sampled, 
the greater the signal to noise ratio and the more successful has been the 
enrichment. 

it) A great many of the steps in this process are amenable to 

high throughput automation - enabling very large numbers of single colony 
TRSPA signatures to be determined with ease and extending the power of 
the current invention to cases where signal to noise ratios are beyond 
current approaches. 

15 Statistical correlations (associations) can initially be drawn 

between TRSPA signatures and phenotype. The clones giving rise to a 
particular TRSPA signature showing a useful association with a phenotype 
of interest can then be sequenced in order to determine at a DNA 
sequence level the association(s) with the phenotype of interest. Such 

:o associations have future predictive values for the phenotype of interest, 
knowing the genotype and will be of great value in medicine and 
pharmacogenetics. 

If the genome of interest is wholly or partially sequenced, we 
can also in silico restrict the DNA with all n enzymes, calculate the 

25 expected signature for each fragment and pattern match these expected 
signatures with the observed signature (taking into account any loss or gain 
of restriction sites due to polymorphic variation compared to the reference 
sequence) to immediately identify the fragment of interest within a gene, 
genomic region, chromosome or whole genome. This latter method will be 

30 of great value in those cases where a great many phenotype-determining 
fragments are obtained and repeat signatures are rare or unobtained. The 
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clustering of phenotype-determining fragments to adjacent DNA regions 
thus gives an association between those genomic regions and the 
phenotype of interest. 

5 Aspect (IV) 

In yet another aspect the invention provides a double- 
stranded DNA molecule having the sequence a-A-b-B...X-y-Y-z where A t 
B...X and Y are unique restriction sites for n different restriction 
endonuclease enzymes, and a, b...y, z denotes distances in base pairs, 
id characterised in that each fragment, obtainable by cutting the DNA 

molecule by means of any one or more up to n of the restriction enzymes, 
has a different length from every other fragment. 

An example totally diagnostic internal control DNA which allows both 
15 the extent and exact nature of any example set 1 (or example set 2) 6 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion or TRSPA restriction 

In both of the above schemes, it is important that limit 
digestion products are obtained. Monitoring the extent of partial digestion 

20 resulting from multi-enzymatic restriction and determining precisely which 
enzymes have failed to cut is a task of great importance. 

If we have up to six enzymes for DNA digestion - let us label 
these A, B, C, D, E and F. We need to somehow determine that these 
have all cut to completion during the fragmentation stage for inter- 

25 population perfectly matched duplex depletion and also for the digestion 
step prior to adaptor ligation in TRSPA fabrication. If any of the enzymes 
have failed to cut to completion, we need to know which ones and to what 
degree in order to effectively rectify the problem. 

30 The structure of th internal control DNA 

If we construct a double stranded DNA molecule with the 
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following structure 

end—t— A— u— B— v--C— w— D— x— E— y— F— z— end 

5 where the A, B, C, D, E and F denote the sites for restriction 

enzyme cutting and t, u, v, w, x, y and z denote distances in base pairs. 

This internal control DNA is either uniformly pre-labelled and 
added to the DNA of interest at an appropriate concentration prior to 
restriction or is Southern blot probed with a complementary sequence not 
jo found in the DNA of interest after restriction. 

All six enzymes can cut in only one way. 
One enzyme can fail to cut in 6 d = 6 ways, these are: A, B, 
C, D, E or F failing to cut. 

Two enzymes can fail to cut in 6 C 2 = 15 ways, these are: AB, 
15 AC, AD, AE, AF, BC, BD, BE, BF, CD, CE, CF, DE, DF or EF failing to cut. 

Three enzymes can fail to cut in 6 C 3 = 20 ways, these are: 
ABC, ABD, ABE, ABF, ACD, ACE, ACF, ADE, ADF, AEF, BCD, BCE, BCF, 
BDE, BDF, BEF, CDE, CDF, CEF or DEF failing to cut. 

Four enzymes can fail to cut in 6 C 4 = 15 ways, these are: 
:o ABCD, ABCE, ABCF, ABDE, ABDF, ABEF, ACDE, ACDF, ACEF, ADEF, 
BCDE, BCDF, BCEF, BDEF or CDEF failing to cut. 

Five enzymes can fail to cut in 6 C 5 = 6 ways, these are: 
ABCDE, ABCDF, ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 
All six enzymes can fail to cut in only one way. 
25 Each of the above possibilities will generate one or more 

fragments from the internal control DNA. If each possible fragment has a 
discernible size from any other, then we can determine exactly which 
enzymes have cut and which have not from the size distribution of the 
fragments generated. The task is therefore to design such a DNA 
30 molecule. 
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Example simulations 

Seven simulations are given below - varying the size of inter- 
site fragments. Criteria for a successful outcome include the following 

1 . The inter-fragment spacing should be greater for larger 
5 fragments (so as to aid electrophoretic resolution). 

2. All possible fragments should be unambiguously resolvable in 
size from each-other. 

3. Size gaps between bands comprising different numbers of 
inter-site units should be greater than the size gaps between bands 

iu comprising the same number of inter-site units. 

4. The size gaps and size spread from largest to smallest 
fragment should be electrophoretically compatible. 

Simulation 1 

I5 

Inter-site fragment sizes (in bp) 



End— A 


A---B 


B-C 


C— D 


D— E 


E---F 


F— end 


80 


90 


100 


110 


120 


130 


140 
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Possible digestion products obtained (in bp) 





Fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


80 




A— B 


90 


10 


B---C 


100 


10 


C— D 


110 


10 


D— E 


120 


10 


E— F 


130 


10 


F— end 


140 


10 


two unit fragments 






end— B 


170 


30 


A— C 


190 


20 


B— D 


210 


20 


C— E 


230 


20 


D—F 


250 


20 


£.._end 


270 


20 


three unit fraaments 






end C 


270 


0 


A— D 


300 


30 


B— E 


330 


30 


C— F 


360 


30 


D — end 


390 


30 


four unit fragments 






end— D 


380 


-10 


A— E 


420 


40 


B— F 


460 


40 


C— end 


500 


40 


five unit fragments 






end— E 


500 


0 


A— F 


550 


50 


B— end 


600 


50 


six unit fragments 






end— F 


630 


30 


B— end 


690 


60 


seven unit fragments 






end — end 


770 


80 
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Spread = 690 bp 



Simulation 2 



Inter-site fragment sizes (in bp) 



end — A 


A-B 


B—C 


C---D 


D-E 


E— F 


F— end 


90 


100 


110 


120 


130 


140 


150 



Possible digestion products obtained (in bp) 
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fragment length 


size gap to the next 






(in bp) 


smallest fragment 
(in bp) 




one unit fragments 








end—A 


90 






A— B 


100 


10 




B-C 


110 


10 




C— D 


120 


10 




D— E 


130 


10 




E— F 


140 


10 




F— end 


150 


10 


• 3 


two unit fragments 








end— B 


190 


40 




A— C 


210 


20 


?==; 


B-D 


230 


20 


4= 


C-E 


250 


20 




D — F 


270 


20 




E— end 


290 


20 




three unit fragments 








end— C 


300 


10 


ssaas 


A— D 


330 


30 




B— E 


360 


30 


.=5. 


C— F 


390 


30 




D— end 


420 


30 




four unit fragments 








end— D 


420 


0 




A— E 


460 


40 




B— F 


500 


40 




C— end 


540 


40 




five unit fragments 








end— E 


550 


10 




A— F 


600 


50 




B— end 


650 


50 




six unit fragments 








end— F 


690 


40 




B— end 


750 


60 




seven unit fragments 








end — end 


840 


90 



Spread = 750 bp 
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Simulation 3 

Inter-site fragment sizes (in bp) 



end—A 


A— B 


B-C 


C-D 


D--E 


E— F 


F— end 


100 


110 


120 


130 


140 


150 


- 160 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


100 




A— B 


110 


10 


B—C 


120 


10 


C— D 


130 


10 


D— E 


140 


10 


E — F 


150 


10 


F— end 


160 


10 


two unit fragments 






end— B 


210 


50 


A— C 


230 


20 


B—D 


250 


20 


C— E 


270 


20 


D — F 


290 


20 


E— end 


310 


20 


three unit fragments 






end— C 


330 


20 


A— D 


360 


30 


B-E 


390 


30 


C— F 


420 


30 


D— end 


450 


30 


four unit fragments 






end— D 


460 


10 


A— E 


500 


40 


B— F 


540 


40 


C— end 


580 


40 


five unit fragments 






end— E 


600 


20 


A— F 


650 


50 


B— end 


700 


50 


six unit fragments 






end— F 


750 


50 


B — end 


810 


60 


seven unit fragments 






end — end 


910 


100 



Spread = 810 bp 
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Simulation 4 

Inter-site fragment sizes (in bp) 



end—A 


A— B 


B— C 


C-D 


D—E 


E-F 


F— end 


110 


120 


130 


140 


150 


160 


170 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


110 




A— B 


120 


10 


B---C 


130 


10 


C— D 


140 


10 


D-E 


150 


10 


E — F 


160 


10 


F— end 


170 


10 


two unit fragments 






end— B 


230 


60 


A~C 


250 


20 


B~D 


270 


20 


C-E 


290 


20 


D— F 


310 


20 


E— end 


330 


20 


three unit fragments 






end—C 


360 


30 


A— D 


390 


30 


B— E 


420 


30 


C— F 


450 


30 


D— end 


480 


30 


four unit fragments 






end— D 


500 


20 


A---E 


540 


40 


B— F . 


580 


40 


C-»end 


620 


40 


five unit fragments 






end — E 


650 


30 


A— F 


700 


50 


B— end 


750 


50 


six unit fragments 






end — F 


810 


60 


B— end 


870 


60 


seven unit fragments 






end— end 


980 


110 



Spread = 870 bp 
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Simulation 5 

Inter-site fragment sizes (in bp) 



end—A 


A— B 


B-C 


C---D 


D— E 


E— F 


F — end 


120 


130 


140 


150 


160 


170 


180 



Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


120 




A— B 


130 


10 


B—C 


140 


10 


C-D 


150 


10 


D— E 


160 


10 


E — F 


170 


10 


F— end 


180 


10 


two unit fragments 






end—B 


250 


70 


A— C 


270 


20 


B— D 


290 


20 


C— E 


310 


20 


D— F 


330 


20 


E—end 


350 


20 


three unit fragments 






end~-C 


390 


40 


A--D 


420 


30 


B— E 


450 


30 


C-F 


480 


30 


D— end 


510 


30 


four unit fragments 






end— D 


540 


30 


A— E 


580 


40 


B--F 


620 


40 


C— end 


660 


40 


five unit fragments 






end— E 


700 


40 


A--F 


750 


50 


B— end 


800 


50 


six unit fragments 






end— F 


870 


70 


B— end 


930 


60 


seven unit fragments 






end—end 


1050 


120 



Spread = 930 bp 
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Simulation 6 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C---D 


D— E 


E— F 


F— end 


130 


140 


150 


160 


170 


180 


190 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


130 




A---B 


140 


10 


B-C 


150 


10 


C— D 


160 


10 


D— E 


170 


10 


E-F 


180 


10 


F— end 


190 


10 


two unit fragments 






end— B 


270 


80 


A-C 


290 


20 


B— D 


310 


20 


C— E 


330 


20 


D---F 


350 


20 


E— end 


370 


20 


three unit fragments 






end— C 


420 


50 


A— D 


450 


30 


B— E 


480 


30 


C— F 


510 


30 


D— end 


540 


30 


four unit fragments 






end— D 


580 


40 


A-E 


620 


40 


B— F 


660 


40 


C— end 


700 


40 


five unit fragments 






end— E 


750 


50 


A-F 


800 


50 


B— end 


850 


50 


six unit fragments 






end— F 


930 


80 


B— end 


990 


60 


seven unit fragments 






end— end 


1120 


130 



Spread = 990 bp 
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Simulation 7 

Inter-site fragment sizes (in bp) 



end— A 


A--B 


B-C 


C-D 


D-E 


E-F 


F— end 


140 


150 


160 


170 


180 


190 


. 200 



Possible digestion products obtained (in bp) 
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fragment length 


size gap to the next 






(in bp) 


smallest fragment 
(in bp) 




one unit fragments 








end— A 


140 






A— B 


150 


10 




B— C 


160 


10 




C— D 


170 


10 




D— E 


180 


10 




E— F 


190 


10 




F— end 


200 


10 


; 5 


two unit fragments 






. f% 


end— B 


290 


90 










• iT^ 


A— C 


310 


20 




B— D 


330 


20 




C~E 


350 


20 












D— F 


370 


20 


s 


E— end 


390 


20 




three unit fragments 








end—C 


450 


60 




A--D 


480 


30 




B---E 


510 


30 




C-F 


540 . 


30 




D— end 


570 


30 




four unit fragments 








end— D 


620 


50 




A— E 


660 


40 




B— F 


700 


40 




C— end 


740 


40 




five unit fragments 








end— E 


800 


60, 




A— F 


850 


50 




B— end 


900 


50 




six unit fragments 








end— F 


990 


90 




B—end 


1050 


60 




seven unit fragments 








end — end 


1190 


140 



Spread = 1050 bp 
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According to the above criteria for success, simulation 7 
above clearly fulfils all of the requirements. 

An example totally diagnostic internal control DNA which allows both 
5 the extent and exact nature of any example set 1 (or example set 2) 4 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion 

If we have up to three enzymes for DNA digestion - let us 
label these A, B and C. We need to somehow determine that these have 
io all cut to completion during the fragmentation stage for inter-population 
* perfectly matched duplex depletion. If any of the enzymes have failed to 
cut to completion, we need to know which ones and to what degree in order 
to effectively rectify the problem. 

I5 The structure of the internal control DNA 

If we construct a double stranded DNA molecule with the 
following structure 

end— t— A— u— B— v— C— w— end 

20 

where the A, B and C denote the sites for restriction enzyme 
cutting and t, u, v and w denote distances in base pairs. 

This internal control DNA is uniformly pre-labelled and added 
to the DNA of interest at an appropriate concentration prior to restriction or 
25 is Southern blot probed with a complementary sequence not found in the 
DNA of interest after restriction. 

All three enzymes can cut in only one way. v 

One enzyme can fail to cut in 3 d = 3 ways, these are: A, B or 
C failing to cut. 

30 Two enzymes can fail to cut in 3 C 2 = 3 ways, these are: AB, 

AC or BC failing to cut. 
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All three enzymes can fail to cut in only one way. 

Each of the above possibilities will generate one or more 
fragments from the internal control DNA. If each possible fragment has a 
discernible size from any other (and from any of the fragments in simulation 
? 7 above for the panel of up to 6 enzymes), then we can determine exactly 
which enzymes have cut and which have not from the size distribution of 
the fragments generated. The task is therefore to design such a DNA 
molecule. '. 

10 Example simulations 

Six simulations are given below - varying the size of inter-site 
fragments. Criteria for a successful outcome included the following: 
1 The inter-fragment spacing should be greater for larger 

fragments (so as to aid electrophoretic resolution), 
is 2. All possible fragments should be unambiguously resolvable in 

size from each-other. 

3 Size gaps between bands comprising different numbers of 

inter-site units should ideally be greater than the size gaps between bands 
comprising the same number of inter-site units. 
20 4. The size gaps and size spread from largest to smallest 

fragment should be electrophoretically compatible. 
5 The largest fragment obtained should ideally be smaller than 

the smallest fragment obtained in simulation 7 above for the panel of up to 
six enzymes. 



25 
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Simulation 1 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B~C 


C— end 


10 


20 


30 


40 



5 

Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


10 




A— B 


20 


10 


B — C 


30 


10 


C— end 


40 


10 


two unit fragments 






end— B 


30 


-10 


A— C 


50 


20 


B— end 


70 


20 


three unit fragments 






end— C 


60 


-10 


A—end 


90 


30 


four unit fragments 






end — end 


100 


10 



Spread = 90 bp 

10 

Simulation 2 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B---C 


C—end 


15 


25 


35 


45 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


1 5 




A---B 


25 


10 


D — O 


JO 


10 


C— end 


45 


10 


two unit fragments 






end— B 


40 


-5 


A— C 


60 


20 


B — end 


80 


20 


three unit fragments 






end— C 


75 


-5 


A — end 


105 


30 


four unit fragments 






end—end 


120 


15 



Spread = 105 bp 

5 

Simulation 3 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B-C 


C— end 


20 


30 


40 


50 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fraqment 
(in bp) 


one unit fragments 






end— A 


20 




A— B 


30 


10 


B— C 


40 


10 


C— end 


50 


10 


two unit fragments 






end— B 


50 


0 


A— C 


70 


20 


B — end 


90 


20 


three unit fragments 






end— C 


90 


0 


A — end 


120 


30 


four unit fragments 






end—end 


140 


20 



Spread = 120 bp 

5 

Simulation 4 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B-C 


C— end 


25 


35 


45 


55 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
■ (in bp) 


one unit fragments 






end — A 


25 




A— B 


35 


10 


B— C 


45 


10 


C— end 


55 


10 


two unit fragments 






end— B 


60 


5 


A— C 


80 


20. 


3— end 


100 


20 


three unit fragments 






enq—C 


105 


5 


A — end 


135 


30 


four unit fragments 






end—end 


160 


25 - 



Spread = 135 bp 

5 

Simulation 5 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B— C 


C— end 


20 


25 


30 


35 
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Possible digestion products obtained (in bp) 





iraymeni lenyin 

fin hn\ 


ci7o nan tn tho novt 
bl£c yap IU ll l c McAi 

^mailp^t fraoment 
fin dd) 


Ann ■ iriii Ironmontc 

one unn iioymciuo 






6nd"~A 


20 




A Q 




5 


B---C 


30 


5 


C--end 


35 


5 


two unit fragments 






end— B 


45 


10 - 


A--C 


55 


10 


B— end 


65 


10 


three unit fragments 






end— C 


75 


10 


A — end 


90 


15- 


four unit fragments 






end— end 


110 


20 



Spread = 90 bp 

5 

Simulation 6 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B~C 


C— end 


25 


30 


35 


40 



10 
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Possible digestion products obtained (in bp) 





fragment length 
tin bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


25 




A— B 


30 


5 


B-C 


35 


5 


C— end 


40 


5 


two unit fragments 






end— B 


55 


15 


A-C 


65 


10 


B— end 


75 


10 


three unit fragments 






end— C 


90 


15 


A — end 


105 


15 


four unit fragments 






end— end 


130 


25 



Spread = 1 05 bp 

5 

According to the above criteria for success, simulation 6 
clearly fulfils ail of the requirements. 

Example determination of the entire set of internal control DNA limit 
io and partial digestion patterns for a panel of up to six restriction 
enzymes 

For the example simulation 7, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to six restriction 
enzymes can be determined as below. 



15 
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Inter-site fragment sizes (in bp) 



end— A 


A— B 


B-C 


C-D 


D-E 


E-F 


F— end 


140 


150 


160 


170 


180 


190 


200 



Possible digestion products obtained 

5 All six enzymes can cut in only one way. 



failed digests 


bp 


bp 


bp 


bp 


bp 


bp 


bp 


none 


140 


150 


160 


170 


180 


190 


200 



One enzyme can fail to cut in Ci = 6 ways, these are: A, B, 
C, D, E or F failing to cut. 

10 



failed digest 


bp 


bp 


bp . 


bp 


bp 


bp 


A 


290 


160 


170 


180 


190 


200 


B 


140 


310 


170 


180 


190 


200 


C 


140 


150 


330 


180 


190 


200 


D 


140 


150 


160 


350 


190 


200 


E 


140 


150 


160 


170 


370 


200 


F 


140 


150 


160 


170 


180 


390 
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Two enzymes can fail to cut in 6 C 2 = 15 ways, these are: AB, 
AC, AD. AE. AF. BC, BD. BE, BF. CD. CE, CF, DE, DF or EF failing to cut. 



failed digests 


bp 


Dp 


bp 


DP 


hn 
Dp 


AB 


450 


1 ~7f\ 

I /u 




1 on 
i yu 




AC 


290 


330 


1 on 
l OU 


1 an 
i yu 


onc\ 


AD 


290 


l bU 




1 an 
i yu 


OC\C\ 
c\j\J 


AC 
MC 




160 


170 


370 


200 


AF 


290 


160 


170 


180 


390 


BC 


140 


480 


180 


190 


200 


BD ' 


140 


310 


350 


190 


200 


BE 


140 


310 


170 


370 


200 


BF 


140 


310 


170 


180 


390 


CD 


140 


150 


510 


190 


200 


CE 


140 


150 


330 


370 


200 


CF 


140 


150 


330 


180 


390 


DE 


140 


150 


160 


540 


200 


DF 


140 


150 


160 


350 


390 


EF 


140 


150 


160 


170 


570 



5 
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Three enzymes can fail to cut in 6 C 3 = 20 ways, these are: 
ABC. ABD. ABE. ABF, ACD, ACE, ACF. ADE, ADF, AEF, BCD, BCE, BCF, 
BDE, BDF, BEF, CDE, CDF, CEF or DEF failing to cut. 



failed digests 


bp 


bp 


bp 


bp 


ABC 


620 


180 


190 


200 


ABD 


450 


350 


190 


200 


ABE 


450 


170 


370 


200 


ABF 


450 


170 


180 


390 


ACD 


290 


510 


190 


200 


ACE 


290 


330 


370 


200 


ACF 


290 


330 


180 


390 


ADE 


290 


160 


540 


200 


ADF 


290 


160 


350 


390 


AEF 


290 


160 


170 


570 


BCD 


140 


660 


190 


200 


BCE 


140 


480 


370 


200 


BCF 


140 


480, 


180 


390 


BDE 


140 


310 


540 


200 


BDF 


140 


310 


350 


390 


BEF 


140 


310 


170 


570 


CDE 


140 


150 


700 


200 


CDF 


140 


150 


510 


390 


CEF 


140 


150 


330 


570 


DEF 


140 


150 


160 


740 



5 
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Four enzymes can fail to cut in 6 C 4 = 15 ways, these are: 
ABCD, ABCE, ABCF. ABDE, ABDF. ABEF, ACDE, ACDF. ACEF, ADEF, 
BCDE. BCDF. BCEF. BDEF or CDEF failing to cut. 



failed digests 


bp 


bp 


bp 


ACCD 


800 


190 


200 


ABCE 


620 


370 


200 


ABCF 


620 


180 


390 


ABDE 


450 


540 


200 


ABDF 


450 


350 


390 


ABEF 


450 


170 


570 


ACDE 


290 


700 


200 


ACDF 


290 


510 


390 


ACEF 


290 


330 


570 


ADEF 


290 


160 


740 


BCDE 


140 


850 


200 


BCDF 


140 


660 


390 


BCEF 


140 


480 


570 


BDEF 


140 


310 


740 


CDEF 


140 


150 


900 



5 
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Five enzymes can fail to cut in 5 C 5 = 6 ways, these are: 
ABCDE, ABCDF.. ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 



failed digests 


bp 


bp 


ABODE 


990 


200 


ABCDF 


800 


390 


ABCEF 


620 


.570 


ABDEF 


450 


740 


ACDEF 


290 


900 


BCDEF 


140 


1050 



All six enzymes can fail to cut in only one way. 



failed digests 


bp 


all 


1190 



Example determination of the entire set of internal control DNA limit 
and partial digestion patterns for a panel of up to three restriction 
io enzymes 

For the example simulation 6, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to three restriction 
enzymes can be determined as below. 

1 5 Inter-site fragment sizes (in bp) 



end— A 


A— B 


B~C 


C— end 


25 


30 


35 


40 



Possible digestion products obtained 

All three enzymes can cut in only one way. 

20 



failed digests 


bp 


bp 


bp 


bp 


none 


25 


30 


35 


40 
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One enzyme can fail to cut in 3 Ci = 3 ways, these are: A, B or 
C failing to cut. 



failed digests 


bp 


bp 


bp 


A 


55 


35 


40 


B 


25 


65 


40 


C 


25 


30 


75 



Two enzymes can fail to cut in 3 C 2 = 3 ways, these are: AB, 
AC or BC failing to cut. 



failed digests 


bp 


bp 


AB 


90 


40 


. AC 


55 


75 


BC 


25 


105 



AH three enzymes can fail to cut in only one way. 

io 



failed digests 


bp 


all 


130 
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Example 1a - The digestion internal control plasmid for the 6 bp 
cutter set 1 TRSPA enzymes BamHI, BsrGl, Hindlll, Ncol, Spel, and 
Aflll 

The plasmid pNW33 (shown below) was constructed to 
5 contain an insert with all of the 6 bp cutter TRSPA enzyme sites. 



lo bp fragments. The full sequence for pNW33 is shown below: 

tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgcc 
gggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagca 
gattgtactgagagtgcaccatatgcggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgccattc 

15 gccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagctggcgaaagggggatgt 
gctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaaaacgacggccagtgaattcgagctcgg 
taccgggccccccctcgaggtcgacggtatcgataagcttgatcgcagctggtaatccggacgcccgcgtcgaagatgttgg 
ggtgttgtaacaatatcgattccaattcagcgggggccacctgatatcctttgtatttaattaaagacttcaagcggtcaactatga 
agaagtgttcgtcttcgtcccagtaaggatccgcactttgaattttgtaatcctgaagggatcgtaaaaacagctcttcttcaaatct 

20 atacattaagacgactcgaaatccacatatcaaatatccgagtgtagtaaacattccaaaaccgtgatggaatggaacaaca 
cttaaatgtacaccctggtaatccgttttagaatccatgataataattttctggattattggtaattttttttgcacgttcaaaattttttgca 
acccctttttggaaacaaacactacggtaggctgcgaaatgttcatactgttgagcaattcacgttcattataagcttttcactgcat 
acgacgattctgtgatttgtattcagcccatatcgtttcatagcttctgccaaccgaacggacatttcgaagtattccgcgtacgtg 




Hindlll (445) 
-h^BspEI (466) 

14C ibp\fiamtf/(B05j 



BspEI sites define the outer ends of the 140 bp and the 200 
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atgttcacctcgatatgtgcatctgtaaaagcaattgtto^ 

agtgttgcgatgctaatgccgttacaaatattccgagcaccaagaatggctgcgcgcttgcctggtacttgacgtcgtatttgacg 

gggtccttgagaaagtatttaaactggaacacaatctgaggaatgatcaaagcaaccaacgccaacgcataataactagtg 

caataccaagacctcccaataatagcacccagacttgtgtaataacctctggctctgatattgctccagatggaattggacgat 

5 atggctcattaattgcgtcgatatctctatcataccagtcgttgattgtctgtgtatagccagtaagacaaggaccagacatcatca 
tgcaaagaatcgcttaagcccttcttggcctttatgaggatctctctgatttttcttgcgtcgagttttccggtaagacctttcggtactt 
cgtccacaaacacaactcctccgcgcaactttttcgcggttgttacttgactggccacgtaatccacgatctctttttccgtcatcgt 
ctttccgtgctccaaaacaacaacggcggcgggtccggattaccagctgcgatcaagcttatcgataccgtcgacctcgacct 
gcaggcatgcaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacga 

10 gccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttc 
cagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttc 
cgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggtt 
atccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaag 
gccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaa 

15 acccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccg 
gatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcg 
ctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 
ggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacag 
agttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcgga 

20 aaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 
agaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggatt 
ttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagt 
aaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgact 
ccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctca 

25 ccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcc 
atccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggc 
atcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgt 
gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagc 
actgcataattctcttactgtcatgccatccgtaagatgctmctgtgactggtgagtactcaaccaagtcattctgagaafagtgta 

30 tgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcatt 
ggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaac 
tgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataag 
ggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggataca 
tatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaacc 

35 attattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtc 
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The insert was introduced as a four fragment ligation of 140 
(Kpnl-BamHI), 150/ 160 (BamHI-Hindlll), 170/ 180 (Hindlll-Spel) and 
190 / 200 (Spel-Xhol) into Kpnl / Sail digested pUCl9c DNA (Genbank 
X02514). The Sail and Xhol sites were lost as a result of the joining of their 
5 compatible sticky ends. The insert can be removed with BspEI or Pvull. 

The insert region of 1 190 bp and the short flanking regions to 
the vector junctions were sequenced twice in each direction in order to 
establish the plasmid sequence. In addition, a total of 63 analytical 
restriction digests and one minus-enzyme control were performed as 
to detailed in the following table: 







Expected band sizes 




Digest 


Well 


bp 


Dp 


Dp 


Dp 


Dp 


Dp 


bp 


Uncut 




1190 














BamHI 


2 


140 


1050 












BsrGI 


3 


290 


900 












Hindlli 


4 


450 


740 












Ncol ■.. ... 


5 


620 


570 : 












Spel 


6 


800 


390 












Aflli 


7 


990 


200 












damHI BsrGI 


8 


140 


150 


900 










BamHI Hindlli 


9 


140 


310 


740 










BamHI Ncol 


10 


140 


480 


570 










BamHI Spel 


11 


140 


660 


390 










BamHI Atlll 


12 


140 


, 850 


200 










BsrGI Hindlli 


13 


290 


■ 160 


740 










BsrGI Ncol 


14 


290 


330 


570 










BsrGI Spel 


15 


290 


510 


390 










BsrGI Aflli 


16 


290 


700 


200 










Hindlli Ncol 


17 


450 


170 


570 










Hindlli Spel 


18 


450 


350 


390 










Hindlli Atlll 


19 


450 


540 


200 










Ncol Spel 


20 


620 


180 


390 










Ncol Mil 


21 


620 


370 


200 , 










Spel Aim 


22 


800 


190 


200 










BamHI BsrGI Hindlli 


23 


140 


150 


160 


740. 








BamHI BsrGI Ncol 


24 


140 


150 


330 


570 








BamHI BsrGI Spel 


25 


140 


150 


510 


390 








BamHI BsrGI Aflli 


26 


140 


150 


700 


200 
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Qami-il 1-tinriHl \tml 

OafTirit ninuin i\lui 


27 j 140 


310 


170 


570 








Eg ml— It hJtnrtlll Cna/ 


28 


140 


310 


350 


390 










29 


140 


310 


540 


200 








aamnf ncoi jpet 


30 


140 


480 


1 80 


390 








03mHi NCOI Aflll 


31 


140 


480 


370 


200 








aamrii opei Atiti 


32 


140 


DDU 


1 90 


200 








BsrGI Hindlll Ncol 


33 


290 


1 fin 

1 DU 


1 7fl 


Of u 








BsrGI Hindlll Spel 


34 


290 


1 en 


tea 










BsrGt Hindlll Aflli 


35 


290 


1 ou 


Of U 










BsrGI Ncol Spel 


36 


290 




1 or* 


oyu 








BsrGI Ncol Ami 


37 


290 




u/ U 


oc\c\ 

CUW.: 








BsrGI Spel Aflll 


38 


290 


510 


i yu 


^UU 








Hindlll Ncol Spel 


39 . 


450 


170 


1 Qf\ 

l OU 


ion 








Hindlll Ncol Aflll 


40 


450 


170 


370 


200 








Hindlll Spel Aflll 


41 


450 


350 


190 


200 








Ncol Spel Aflll 


42 


620 


180 


190 


200 








BamHI BsrGI Hindlll Ncol 


43 


140 


150 


1 en 
1 OU 


A ~7f\ 

1 f\j 








BamHl BsrGI Hindlll Spel 


44 


140 


150 




ooU 








BamHI BsrGI Hindlll Atlll 


45 


140 


150 


1 60 


540 








BamHI BsrGI Ncol Spel 


46 


140 


150 


330 


180 


390 






BamHI BsrGI Ncol Aflll 


47 


140 


150 


330 


370 


200 






BamHI BsrGI Spel Aflll 


48 


140 


150 


510 


190 


200 






BamHI Hindlll Ncol Spel 


49 


140 


310 


170 


1 80 ;. 








BamHI Hindlll Ncol Aflll 


50 


140 


310 


170 


370 


200 






BamHI Hindlll Spel Aflll 


51 


140 


310 


350 


1 90 








BamHI Ncol Spel Aflll 


52 


140 


480 


180 


190 


200 






BsrGI Hindlll Ncol Spel 


53 


290 


160 


170 


180 1 


390 






BsrGI Hindlll Ncol Aflll 


54 


290 


160 


1 70 










BsrGI Hindlll Spel Aflll 


55 


290 


160 


350 


190 








RcrCZl Nmi Qrws/ Aflll 

OOI\JI /VOL?/ >JfJCl Mllll 


56 


290 


330 


180 


190 


200 






Hindlll Ncol Spel Atlll 


57 


450 


170 


180 


190 


200 






BamHI BsrGI Hindlll Ncol Spel 


58 


140 


150 


160 


170 


180 


390 




BamHI BsrGI Hindlll Ncol Aflll 


59 


140 


150 


160 


170 


370 


200 




BamHI BsrGI Hindlll Spel Aflll 


60 


140 


150 


160 


350 


190 


200 




BamHI BsrGI Ncol Spel Aflll 


61 


140 


150 


330 


180 


190 






BamHI Hindlll Ncol Spel Aflll 


62 


140 


. 310 


170 


180 


190 


200 




BsrGI Hindlll Ncol Spel Aflll 


63 


290 


160 


170 


180 


190 


200 




BamHI BsrGI Hindi 11 Ncol Spel Aflll 


64 


140 


150 


160 


170 


180 


190 


200 



All of these digests produced the expected fragment patterns 
on agarose gel electrophoresis and a number of these, shaded in the table 
above, are shown in figure 1 . The restriction digests illustrated in figure 1 
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were carried out using the following conditions: 
Digest 1 (Minus enzyme control) 



10 jil 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2+100 Mg/ml BSA 


90 j 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 5 


15 Hi I 


0.33 U/ui Ncof in 1 x NEB buffer #2 + 100 Mg/ml BSA 


10 ill 


100 ug/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 Mg/ml BSA 


75 nl 


1x NEB buffer #2 + 100 Mg/ml BSA 


Digest 9 


15 jil 


0.33 U/liI BamHI in 1x NEB buffer #2 + 100 Mg/ml BSA 


15 ill 


0.33 U/fil Hindlli \n 1x NEB buffer #2 + 100 Mg/ml BSA 


10^1 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2+100 Mg/ml BSA 


60 |il 


1x NEB buffer #2+100 Mg/ml BSA 


Digest 37 


15 Ml 


0.33 U/jjI BsrGi in 1x NEB buffer #2 + 100 Mg/ml BSA 


15 Ml 


0.33 U/mI A/co/ in 1x NEB buffer #2+100 Mg/ml BSA 


15 Ml 


0.33 U/mI Aflll in 1x NEB buffer #2 + 100 Mg/ml BSA 


10 Ml 


100 Mg/ml internal control plasmid DNA in 1x NEB buffer #2+100 Mg/ml BSA 


45 M» 


1x NEB buffer #2 + 100 Mg/ml BSA 



10 



15 
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Digest 49 



15 ul 


0.33 U/ul BamHI 'm 1x NEB buffer #2 + 100 ug/mi BSA 


15 ul 


0.33 U/ul Hindlll in 1 x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/ul A/co/in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/ul Spel in 1 x NEB buffer #2 + 100 ug/ml BSA 


10 ul 


100 ug/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


30 ul 


1 x NEB buffer #2 + 1 00 ug/ml BSA 


Digest 61 


15 ul 


0.33 U/ul BamHl\n 1x NEB buffer #2+100 ug/ml BSA 


15 ul 


0.33 U/ul BsrGI in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/ul A/co/in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/u! Spel in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


0.33 U/ul Afilf in 1 x NEB buffer #2+100 ug/ml BSA 


10 ul 


100 ug/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ul 


1x NEB buffer #2 + 100 ug/ml BSA 



All 64 restriction digests were incubated at 37°C for 6 hours 
and samples were then electrophoresed on 2.5 % FMC MetaPhor agarose 
gels as illustrated in figure 1 . The lanes marked M contain mixed samples 
io from ail of the digests 1-64 and these lanes were used as size markers 
after confirmation of the fragment sizes against Stratagene Kb ladder 
markers. 

When spiked into a genomic DNA digest the internal control 
restriction fragment pattern produced is indicative of both the degree of 
is digestion and the nature of any partial restriction at less than limit digestion. 
It is possible to deduce from the bands present which, if any, of the 
enzymes have failed to cut and therefore to take action to correct this 
before the DNA is used in a subsequent analysis or enrichment procedure. 



20 
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Preparation of SspEZ-released internal control DNA on a large scale 

The method used lor generating internal control PCR product 
DNA (free from contaminating dNTPs) is depicted in the following figure: 



pNW33 
DNA 



1 



crude bio + dNTP 

PCP product | 010 

SA-caotured 0-SA-bio___ + dNTP 

POP product j 0'0-SA-# 

clean fr-SA-bi o ^ A 

PCF. product 8>pE| t »o.o-*A^ 

digest 

internal _ 

control for WE digests 



Primers and PCR 

20uMBIGi40UP 

10 5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT 3' 
20 uM B1O200DOWN 

5" biotin-CGCAGCTGGTAATCCGGACCCGCCGCCGTTGTTGTT 3' 

15 Bulk PCR amplification (192x 100 nl reactions) was carried 

out according to the conditions described below: 





per 20 ml 


final conditions 


100 mM dATP 


20 ul 


100 uM 


100 mM dCTP 


20 ul 


100 uM 


100 mM dGTP 


20 ul 


100 uM 


100 mM dTTP 


20 ul 


100 uM 


10x PCR buffer 


2 mi 


1x 


25 mM MgCI 2 


400 ul 


1/50th volume 


1 nq/ul pNW33 


400 ui 


2 ng per PCR 


20 uM BIO140UP 


200 ul 


20 pmol per PCR 


20 uM BIO200DOWN 


200 ul 


20 pmol per PCR 


Taq DNA polymerase 


100 ul 


2.5 U per PCR 
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The master mix was rapidly dispensed into PGR tubes. 
Thermal cycling was initiated using the following parameters: 30 cycles of 
97 C C for 1 min, 50 C C for 2 min, and 72 C C for 3 min; 72 C C for 5 min; and 
5 then 4°C. 

After PGR amplification, samples were pooled and subjected 
to capture of biotinylated PCR product termini and BspEI release of internal 
control DNA. 

io Capture of biotinylated PCR product termini and BspEI release of 
internal control DNA 

All separations were carried out using a Dynal MPC-1 
separator (Dynal, product #12001). 

20 ml of pooled PCR reaction were mixed with 20 ml of 
15 Dynabeads M-280 in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
NaCI. The tube was incubated at room temperature for 1 hour with rolling. 

The Dynabeads were then washed four times with 20 ml of 10 
mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. 

BspEI digestion was carried out for 1 hour at 37°C in 2 ml of 
:o 0.5 U/|il BspEI in 1x NEB buffer #3. The digest was then ethanol 

precipitated by the addition of 0.1 volume of 3 M sodium acetate (pH 5.2) 
and 2.5 volumes of ethanol, chilling to -20°C and then centrifugation. 
Pellets were rinsed with 70 % ethanol prior to redissolution in 500 )il of 1x 
TE buffer. 

25 1 |il of the BspEI-released internal control DNA was mixed 

with 10 (il of 50 % glycerol AGE loading dye and electrophoresed on a 1.5 
% agarose gel to confirm that the size of the purified DNA was in 
accordance with that expected. 



30 
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High molecular weight genomic DNA digestion in the presence of 
internal control DNA with a dilution series of a mixture of the six set 1 
6 bp cutters 

High molecular weight canine genomic DNA was first mixed 
5 with a dilution series of a mixture of the six set 1 6 bp cutters - each at the 
same number of units, Aliquots were then removed and mixed with the 
BspEI-released internal control DNA described above. 20 \xg of canine 
genomic DNA was digested with 0.25, 0.025, 0.0025, 0.00025, 0.000025, 
0.0000025 and 0 U/ul BamH1 / BsrGI / Hindlll / Ncol / Spel / Af III in 200 (xl. 
io 0.4 jag of canine genomic DNA and 1 jal of BspEI-released internal control 
DNA were digested with 0.25, 0.025, 0.0025, 0.00025, 0.000025, 
0.0000025 and 0 U/jjlI BamH1 / BsrGI / Hindlll / Ncol / Spel / Aflll in 4 \il 
Vistra Green staining was used to monitor the extent of internal control 
DNA digestion and high molecular weight DNA digestion. The fragment 
1 5 pattern from the internal control DNA is diagnostic of both the degree of 
digestion and the nature of any partial restriction at less than limit digestion. 

A premix of restriction enzymes, buffer and BSA was 
prepared as detailed below: 



component 




100 \Ji\i\BamHl 


1 nl 


10 U/|il BsrGI 


10 ill 


40 U/nl Hindlll 


2.5 nl 


50 U/|il Ncol 


2 Hi 


50 U/nl Spel 


2 Hi 


20 U/nl Aflll 


5 Hi 


1 0x NEB buffer #2 


4 H> 


100x BSA 


0.4 Hi 


water 


13. 1 Hi 



20 

Total volume = 40 fil 

The premix therefore contained each restriction enzyme at 
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2.5 U/jliI in 1 x NEB buffer #2 and 1 x BSA. Serial 1 0-fold dilutions of this 
premix were then prepared in 1x NEB buffer #2 and 1x BSA. 

Premixes of canine genomic DNA, buffer and BSA were also 
prepared as detailed below: 



Component 




1 mg/ml canine genomic DNA 


20 ul 


10x NEB buffer #2 


3 ul 


100x BSA 


0.8 ul 


Water 


51. 2 |il 



These premixes therefore contain canine genomic DNA at 
0.25 mg/ml in 1 x NEB buffer #2 and 1 x BSA. 

Canine genomic DNA and restriction enzyme mixes were 
io then set up as follows: 



tube 


component #1 


component #2 


1 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ui of each restriction enzyme at 2.5 
U/ul in 1 x NEB buffer #2 and 1 x BSA 


2 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 pi of each restriction enzyme at 0.25 
U/ul in 1 x NEB buffer #2 and 1 x BSA 


3 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 0.025 
U/ul in 1x NEB buffer #2 and 1x BSA 


4 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ui of each restriction enzyme at 
0.0025 U/^il in 1x NEB buffer #2 and 1x 
BSA 


5 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 
0.00025 U/ul in 1x NEB buffer #2 and 1x 
BSA 


6 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 
0.000025 U/ul in 1x NEB buffer #2 and 
1x BSA 


7 


80 ui of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul oMx NEB buffer #2 and 1x BSA 
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H) 



15 



2 ul aiiquots were then removed from tubes 1 -7 and added to 

1 ul of BspEI-released internal control DNA and 1 ul of 2x NEB buffer #2 
and 2x BSA to give samples 1-7ic. Samples 1-7ic were then overlaid with 
50 \i\ of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-7 was then added to 
100 Ml of 1 x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 

After digestion, 20 ul of digests 1-7 were mixed with 10 ul of 
50 % glycerol AGE loading dye and 4 ul of digests 1-7ic were mixed with 

2 ul of 50 % glycerol AGE loading dye. Digests in loading dye were then 
electrophoresed on a 2.5 % MetaPhor agarose gel in 1x TBE. The gel was 
stained for 60 min in 500 ml of 1x TBE containing 50 ul of Vistra Green. 
The stained gel was finally imaged on a Fiuorimager with the following 
settings: a 488 nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 

200 urn resolution; and low sensitivity. 



Restriction digests - final conditions 

Restriction digests 1-7 were therefore performed under the 

20 following conditions: 



tube 


M9 
canine 
genomic 
DNA 


U 

BamH1 


u 

BsrGI 


U 

Hindlll 


U 
Ncol 


U 
Spel 


U 

Afllf 


NEB 
butter #2 


BSA 


1 


20 


50 


50 


50 


50 


50 


50 


1X 


100 


2 


20 


5 


5 


5 


5 ■ 


5 


5 


1x 


100 


3 


20 


0.5 


0.5 


0.5 


0.5 


0,5 


0.5 


1x 


100 


4 


20 


0.05 


0.05 


0.05 


0.05 


0.05 


0.05 


IX 


100 


5 


20 


0.005 


0.005 


0.005 


0.005 


0.005 


0.005 


1X 


100 


6 


20 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


1x 


100 


7 


20 


0 


0 


0 


0 


0 


0 


1x 


100 



Total volume = 200 jj-l 
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Likewise, restriction digests 1-7ic were performed under the 
following conditions: 



tube 


Mg 


Ml 


U 


u 


u 


u 


u 


u 


NEB 


jig/ml 




canine 


IC 


BamH1 


BsrCI 


Hindltl 


Ncol 


Spel 


Aflll 


buffer 


BSA 




genomic 


DNA 














#2 






DNA 




















lie 


0.4 


1 


1 


1 


1 


1 


1 


1 


1x 


100 


2ic 


0.4 


1 


0.1 


0.1 


0.1 


0.1 


0.1 


0.1 


1x 


100 


3ic 


0.4 


1 


0.01 


0.01 


0.01 


0.01 


0.01 


0.01 


1x 


100 


4ic 


. 0.4 


i 


0.001 


0.001 


0.001 


0.001 


.0.001 


0.001 


1x 


100 


Sic 


0.4 


1 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


1x 


100 


6ic 


0.4 




0.00001 


0.00001 


0.00001 


0.00001 


0.00001 


0.00001 


1X 


100 


7ic 


0.4 




0 


0 ■ 


0 


0 


0 


0 


IX 


100 



IC = BspEI-released internal control DNA 

Total volume = 4 jal 

The results are shown in figure 2. 
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Example 1b - The digestion internal control plasmid for the 4 bp 
cutter set 1 TRSPA enzymes Haelll, Mboi, and Msel 

The plasmid pNW35 (shown below) was constructed to 
contain an insert with all of the 4 bp cutter TRSPA enzyme sites. 



Hindlll (52) 




Hindlll and EcoRI sites define the outer ends of the 25 bp and 
the 40 bp fragments. The sequence of pNW35 is shown below with the 
inserted region shown in bold type: 

10 atgaccatgattacgccaagctctaatacgactcactatagggaaagcttccggacgtctcaggctaatgttggcccacc 
gacgttccacgatggggcgctcttaagggcttagaccctcgtcgggagtalttctgtgatctggcgacactcacgcg 
agaagtcattaccggcgatatgaattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaactta 
atcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcg 
cagcctgaatggcgaatgggaaattgtaaacgttaatattttgttaaaattcgcgttaaatttttgttaaatcagctcattttttaacca 

15 ataggccgaaatcggcaaaatcccttataaatcaaaagaatagaccgagatagggttgagtgttgttccagtttggaacaag 
agtccactattaaagaacgtggactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggcccactacgtgaacc 
atcaccctaatcaagtttttlggggtcgaggtgccgtaaagcactaaatcggaaccctaaagggagcccccgatttagagcttg 
acggggaaagccggcgaacgtggcgagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaa 
gtgtagcggtcacgctgcgcgtaaccaccacacccgccgcgcttaatgcgccgctacagggcgcgtcaggtggcacttttcg 

20 gggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaat 
gcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgttt 
ttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctc 
aacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggt 
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attatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtc 
acagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggcc 
aacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgat 
cgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgtt 

5 gcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcag 
gaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcatt 
gcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacga 
aatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgat 
ttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttcc 

l() actgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaa 
aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcg 
cagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctc 
tgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataa 
ggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatac 

15 ctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcg 
gaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgactt 
gagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggcc 
tmgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgccmgagtga'gctgataccgc 
tcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctc 

20 tccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaat 
taatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataa 

caatttcacacaggaaacagct 

The 4 bp internal control plasmid for restriction enzymes 
25 Haelll, Mbol, and Msell was prepared by the insertion of a synthetic 130 bp 
fragment into Hindlll / EcoRI-digested pMOSblue (Amersham Pharmacia 
Biotech). 

The insert region of 130 bp was sequenced twice in each 
direction in order to establish the plasmid sequence. The presence of the 
30 restriction sites and the mobility of the fragments released were also 
checked by restriction digestion and polyacrylamide gel electrophoresis. 

Preparation of the internal control spike DNA from the plasmid 

The following illustration describes the strategy used for the 
35 preparation of EcoRI-released internal control DNA: 
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pNW35 
DNA 



crude bio , , MTn 

n n j 4 + d NTP 

PlR product | 

SA-captured fr-SA-bio + dNTp 

PCR product . 



clean 0-SA-bio- 
PCR product 



RE digest 

4 bp cutter internal - 
control for RE digests 



'i 



Primers and PCR 

5 U-1 9 mer bio primer 5' bio-GTTTTCCCAGTCACGACGT 3* 
ICPCR(F) primer 5' TCCGGACGTCTCAGGCTAATGTT 3' 

Bulk PCR amplification (96x 100 jnl reactions, repeated to 
io give 192 reactions in total) was carried out according to the conditions 
described below (all volumes are in \xl): 





xl (iaI) 


X100 &J) 


Final condition^ 


Water 


84.4 


8440 




lOx PCR buffer 


10 


1000 


1x 


25 mM dNTP mix 


0.8 


80 


200 \M 


20 u.M 17 promoter-bio primer 


1 


100 


20 pmol per PCR 


20 uM U19 mer-bio primer 


1 


100 


20 pmol per PCR 


1 ng/uJ pNW35 


2 


200 


2 ng per PCR 


5 U/uJ Taq polymerase 


0.8 


80 


4U per PCR 


X-:-. Total** 




10000^.?^; : 
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The PCR master mix. was rapidly dispensed into 96 PGR 
tubes. Thermal cycling was initiated using the following parameters: 94°C 
for 2 min; 50 C C for 2 min; 29 cycles of 72°C for 2 min, 94°C for 45 sec, and 
50°C for 1 min; 72 C C for 8 min; and then 4°C. 

5 

Capture of biotinylated PCR product termini and EcoRI- release of 
internal control DNA 

All separations were carried out using a Dynal MPC-1 

separator (Dynal, product #12001). 
io 20 ml of pooled PCR reaction were mixed with 20 ml of 

Dynabeads M-280 in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 

NaCI. The tube was incubated at room temperature for 1 hour with mixing 

on a Denley Spiramix 5. 

The Dynabeads were then washed four times with 20 ml of 10 
15 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. A fifth wash was 

performed in 20 ml of 1x buffer M. 

EcoRi-digestion was carried out for 1 hour at 37°C in 5 ml of 

0.25 U/ul EcoRI in 1x buffer M. The digest was then divided into ten 500 

ul aliquot parts. Each aliquot was ethanol precipitated by the addition of 1 
:o |LtI of See DNA, 0:1 volume of 3 M sodium acetate (pH 5.2), and 2.5 

volumes of ethanol. The precipitations were mixed and chilled to 0°C on 

ice for 30 minutes and then centrifugation at 20,000 maxRCF for 10 

minutes. The pellets were rinsed with 70 % ethanol before dissolving in a 

total of 500 Hi of 1xTE buffer. 
25 A 1 |jl aliquot of the EcoRI-released internal control DNA was 

electrophoresed on an 8 % polyacrylamide gel to confirm that the size of 

the purified DNA was in accordance with that expected. 
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High molecular weight genomic DNA digestion in the presence of the 
4 bp cutter internal control DNA with a dilution series of the enzymes 
Haelll, Msel, and Mbol 

High molecular weight human placental DNA was mixed with 
5 a dilution series of a mixture of the three set 1 4 bp cutter restriction 
endonucleases - each at the same number of units. Aliquots were then 
removed and mixed with the EcoRI-released internal control DNA. 3.6 jag 
of Human placental DNA (Sigma) was digested with 0.5 U/|al, 0.1 U/jil, 0.02 
U/ul, 0.004 U/|iK and 0 U/ul each of Haelll, Mbol, and Msel in a total 
in volume of 36 ul. 0.4 ug of placental DNA and 1 pi of EcoRI-released 

internal control DNA were digested with 0.5 U/jil, 0.1 U/jal, 0.02 U/|il, 0.004 
U/|il, and 0 U/ul each of Haelll, Mbol, and Msel in a total volume of 4 )llI. 
Vistra Green (Amersham Pharmacia Biotech) staining was used to monitor 
the extent of internal control DNA digestion and high molecular weight DNA 
15 digestion. 

A premix of restriction enzymes (5 U/|il each enzyme), buffer, 
and BSA was prepared as described below: 



Component 




50 U/|il Haelll 


1 


20 U/^il Msel 


2.5 


25 U/(il Mbol 


2 


10x NEB buffer #2 


1 


10x BSA 


1 


Water 


2.5 


Total 


10 



:o Serial 5-fold dilutions of the premix were prepared in 1x NEB 

buffer #2 and 1xBSA. 

A 6x premix of human placental DNA, buffer, and BSA were 
also prepared as described below; 
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j Component 


6x mix (u.1) 


per reaction (uJ) 


i img/ml human placental DNA 


24 


! 4 


i 10x NEB buffer #2 

i 


9.6 


! 1 - 6 


i 10xBSA 
i 


9.6 


; 1.6 


j Water 


52.8 


j 8.8 



This premix contained placental DNA at 0.25 mg/ml in 1x 
NEB buffer #2 and 1x BSA. 

Human placental DNA and restriction mixes were then set up 

5 as follows: 



Tube 


Component #1 


Component #2 


1 


16 ul of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1xBSA 


4 ul of each restriction enzyme at 5 U/ul 
in 1x NEB buffer #2 and 1x BSA 


2 


16 ul of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 ul of each restriction enzyme at 1 U/ul 
in 1x NEB buffer #2 and 1x BSA 


3 


16 ui of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 ul of each restriction enzyme at 0.2 
U/ul in 1x NEB buffer #2 and 1x BSA 


4 


16 ul of placental DNA at 0.25 mg/mi in 
1x NEB buffer #2 and 1x BSA 


4 ul of each restriction enzyme at 0.04 
U/ul in 1x NEB buffer #2 and 1x BSA 


5 


16 juLt of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 pi of 1x NEB buffer #2 and 1x BSA 



2 \x\ aliquots were then removed from tubes 1-5 and added to 
1 |il of EcoRI-released internal control DNA and 1 jil of 2x NEB buffer #2 
io and 2x BSA to give samples 1 -Sic. Samples 1 -Sic were then overlaid with 
50 |il of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-5 was then added to 
18 til of 1x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 
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Restriction digests - final conditions 

Restriction digests 1-5 were therefore performed under the 
following conditions: 



tube 


|jg human 


U 


U 


U 


NEB 


ug/ml 




placental 


Haelll 


Msef 


Mbol 


buffer #2 


BSA 




DNA 












1 


3.6 


20 


20 


20 


1x 


100 


2 


3.6 


4 


4 


4 


1x 


100 


3 


3.6 


0.8 


0.8 


0.8 


1X 


100 


4 


3.6 


0.16 


0.16 


0.16 


1x 


100 


5 


3.6 


0 


0 


0 


1x 


100 



Total volume = 36 ui 

Likewise, restriction digests 1 -Sic were performed under the 
following conditions: 



tube 


ug human 


Pi 


U 


u 


u 


NEB 


Ug/ml 




placental 


IC 


Haelll 


Msel 


Mbol 


buffer #2 


BSA 




DNA 


DNA 












1 


0.4 




2 


2 


2 


1x 


100 


2 


0.4 




0.4 


0.4 


0.4 


1x 


100 


3 


0.4 




0.08 


0.08 


0.08 


1x 


100 


4 


0.4 




0.016 


0.016 


0.016 


1x 


100 


5 


0.4 




0 


0 


0 


1x 


100 



IC = £cofl/-released internal control DNA 
Total volume = 4 jil 

I5 

10 |il of digests 1-5 were each mixed with 3 ul of loading dye 
and 4 jil of digests 1 -Sic were mixed with 1 of loading dye. To sample 
number Sic, 180 ng of PCR molecular weight markers were added to serve 
as size standards. The band sizes for these markers are 50 bp, 100 bp, 
20 200 bp, 300 bp, 400 bp, 500 bp, 525 bp, 700 bp, and 1000 bp. The digests 
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were then electrophoresed on an 8 % polyacrylamide gel in 1x TBE as 
described below: 

10 ml of 10x TBE 
5 1 9.5 ml of 40 % acrylamide 

1 9 ml of 2 % methylene bisacrylamide 
50.4 ml of water 

1 ml of freshly prepared 1 0 % (w/v) ammonium persulphate 
100 ul of TEMED 

io 

A Cambridge Electrophoresis Ltd. vertical protein 
electrophoresis unit was used with 1 mm plate spacing. The samples were 
electrophoresed at 30 mA for 2 hours in 1x TBE. The gel was then stained 
for 30 minutes in 500 ml of 1x TBE containing 50 ul of Vistra Green. The 
is stained gel was imaged on a Fluorimager with the following settings: a 488 
nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 200 um resolution; 
and low sensitivity. 

The results are shown in figure 3. 

20 Example 2 

TRSPA-2 analysis with the pNW33 BamHI, Hindlll, and Aflll matrix and 
the pNW33 Hindlll, Ncol, and Spel matrix probed with the 140 bp 
BspEIXo BamHI fragment from pNW33 

25 As an example of the case where the probe hybridizes to an 

arrayed PCR fragment with a different restriction site at each end, the 
pNW33 BamHI, Hindlll, and Aflll matrix (matrix #7) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 204 bp Hindlll to 

30 BamHI PCR product derived from the 160 bp Hindlll (445) to BamHI (605) 
restriction fragment within pNW33 (see below). 
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pNW33 (3S54 bp) 
(not drawn to scale) 



vndltt (446) 
-^BsoEf (466) 

150 b'p\^ BsrGf (755) 
160 b P V Hindi '// f915] 

170bp >;vco/ noes? 

180 bp \ 

190 bpj^ 

200 bp / 

yk 3sp Ei (1655} 

7- Hindftl (1686) 



■ A f Iff (1455) 



\HinaW(l715) 



As an example of the case where the probe hybridizes to an 
5 arrayed PCR fragment with the same restriction site at each end, the 

pNW33 Hindlll, Ncol, and Spei matrix (matrix #17) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 514 bp Hindlll to 
Hindlll PCR product derived from the 470 bp Hindlll (445) to Hindlll (915) 
io restriction fragment within pNW33 (see above). 

Oligonucleotides 



BamHI short PCR primer 5' TGTAACGACACATTGCTGGATACC 3' 

15 Hindlll short PCR primer 5' ATATAACTCTCGCTCCTTGATAAC 3' 

" Ncol short PCR primer 5' AGGCGTCTGAGGCTGCGGCTATGG 3' 

Spel short PCR primer 5' AACCCGTCGCGACGAGAGTCTAAG 3' 

Aflll short PCR primer 5* GATATACGTGATATATTTTGATTG 3' 



20 BamHI adaptor 5' pGATCGGTATCCAGCAATGTGTCGTTACA 3' 

Hindlll adaptor 5' pAGCTGTTATCAAGGAGCGAGAGTTATAT 3' 

Ncol adaptor 5* pCATGCCATAGCCGCAGCCTCAGACGCCT 3' 

Spel adaptor 5' pCTAGCTTAGACTCTCGTCGCGACGGGTT 3' 
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Aflll adaptor 



5' pTTAACAATCAAAATATATCACGTATATC 3' 



BamHI long PCR primer 5' TGTAACGACACATTGCTGGATACCGATCC 3' 

Hindlll long PCR primer 5' ATATAACTCTCGCTCCTTGATAACAGCTT 3' 

5 Ncol long PCR primer 5' AGGCGTCTGAGGCTGCGGCTATGGCATGG 3' 

Spel long PCR primer 5' AACCCGTCGCGACGAGAGTCTAAGCTAGT 3' 

Aflll long PCR primer 5' GATATACGTGATATATTTTGATTGTTAAG 3' 

Luc140down primer 
io 5' GCGCTAGGGATCCTTACTGGGACGAAGACGAA 3' 

Lud40up-bio primer 

5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT3' 

1 5 Restriction digestion of pNW33 

Restriction digests of pNW33 were set up by combining the 

following: 



20 



Matrix #7 



450 pi 


22 ug/ml pNW33 in 1 x NEB buffer #2 + 1 00 ug/ml BSA 


50 |il 


0.6 U/ul BamHI in 1x NEB buffer #2 + 100 ug/ml BSA 


50 ul 


0.6 U/ul Hindlll in 1x NEB buffer #2 + 100 ug/ml BSA 


50 ul 


0.6 U/ul Aflll in 1x NEB buffer #2 + 100 ug/ml BSA 



Matrix #17 



450 pi 


22 ug/ml pNW33 in 1 x NEB buffer #2 + 1 00 ug/ml BSA 


50 ul 


0.6 U/ul Hindlll in 1x NEB buffer #2 + 1 00 ug/ml BSA 


50 ul 


0.6 U/fil Ncol in 1x NEB buffer #2 + 100 ug/ml BSA 


50 ul 


0.6 U/ul Spel in 1x NEB buffer #2 + 100 ug/ml BSA 
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The samples were then vortex mixed, and incubated at 37°C 

overnight. 

Calf intestinal alkaline phosphatase (CIAP) digestion of pNW33 
5 restriction digests 

400 ul fractions of the 20 restriction digests (each containing 
6.6 |ag of digested pNW33) were ethanol precipitated by the addition of 1 
of See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes 
of ethanol, chilling to -20°C, and then centrifugation. Pellets were rinsed 
io with 70 % ethanol prior to re-dissolution in 20 \x\ of 1x CIAP buffer 

containing 40 U of CIAP. The CIAP digests were carried out for 5 hours at 
37°C and were then made up to 400 t ul with 1 x TE buffer. 

Phenol extraction of pNW33 CIAP digests 

I5 The diluted digests were extracted with 400 pi of phenol and 

then ethanol precipitated as described above but with a 100 % ethanol 
wash after the 70 % ethanol wash. Samples were finally re-dissolved in 
20 pi of TE buffer. 

:o Annealing of short PCR primers to cognate adaptors 

Short PCR primers and their cognate adaptors were annealed 
by adding 1 pi of 200 pM short PCR primer to 1 pi of 200 pM cognate 
adaptor in 20 pi of 50 mM NaCI, 1x TE buffer. The mixed oligonucleotides 
were overlaid with 30 pi of light mineral oil and were then heated to 90°C 
25 for 5 minutes followed by slow cooling to room temperature. The annealed 
short PCR primer / cognate adaptor complexes were then diluted with 1 ml 
of 1 x TE buffer and stored frozen at -20°C. 

Ligation to annealed short PCR primers and cognate adaptors 

30 1 pi of phenol extracted pNW33 digest was used per ligation. 

Ligations were performed in 100 pi of 1x ligase buffer containing 1 mM ATP 
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and 10 ul (100 U) of T4 DNA ligase. 20 pi aliquots from the 1 ml of 
annealed short PCR primer / cognate adaptor complexes were added 
according to the following table. 



#7 


BamHI / HindliU Aflll 


#17 


HindliU Ncoil Spel 



Ligation reactions were carried out for 24 hours at 16°C. 
Samples were then diluted to 1 ml with TE buffer and stored at -20°C. 

The diluted ligation reactions were then further diluted 1 in 10 
and 10 pi was used as PCR template per 100 pi reaction. 

ID 

Restriction digestion of human placental DNA 

1 U (-50 pg) of human placental DNA was digested overnight 
at 37°C with 100 U each of BamHI, BsrGI, Hindlll, Ncol, Spel, and Aflll in 
400 pi of 1x NEB buffer #2 containing 100 pg/ml BSA. 

I5 

CIAP digestion of human placental DNA restriction digest 

The digest was ethanol precipitated by the addition of 1 jil of 
See DNA. 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes of 
ethanol, chilling to -20°C, and then centrifugation. The pellet was rinsed 
:o with 70 % ethanol prior to re-dissolution in 50 pi of 1x CIAP buffer 

containing 100 U of CIAP. The CIAP digest was carried out for 5 hours at 
37°C and was then made up to 400 pi with 1x TE buffer. 

Phenol extraction of human placental DNA CIAP digest 

25 The diluted CIAP digest was extracted with 400 pi of phenol 

and then ethanol precipitated as described above - again with a 100 % 
ethanol wash after the 70 % ethanol wash. The sample was finally re- 
dissolved in 10 pi of TE buffer. 
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Ligation to annealed short PCR primers and cognate adaptors 

Short PCR primers were annealed to their cognate adaptors 

as described above. 

The ligation to annealed short PCR primers and cognate 
« adaptors was carried out in 100 pi of 1x ligase buffer with 1 mM ATP, 100 
U of T4 DNA ligase and 10 pi of each of the six short PCR primer / cognate 

adaptor complexes as above. 

The ligation reaction was carried out for 24 hours at 1 6°C. 
The sample was then diluted to 1 ml with 1x TE buffer and stored at -20°C. 
I0 0.2 pi was used as PCR template per 1 00 pi reaction. 

PCR amplification conditions 

An initial touch-down reaction was carried out in 50 pi of 1x 
PCR buffer with all four dNTPs at 200 pM and Taq DNA polymerase at 
l5 0.05 U/pl. Long PCR primers were used at 400 nM. 10 pi of pNW33 PCR 
template was used per reaction and 0.2 pi of human placental DNA PCR 
template was used per reaction. The samples were overlaid with 40 pi of 
light mineral oil and were touch-down thermocycled as described below: 

20 98°C for 1 min, 72°C for 2 min, 72°C for 5 min 
98°C for 1 min, 69°C for 2 min, 72°C for 5 min 
98°C for 1 min, 66°C for 2 min, 72°C for 5 min 
98°C for 1 min, 63°C for 2 min, 72°C for 5 min 
98°C for 1 min, 60°C for 2 min, 72°C for 5 min 

The main thermal cycling was then carried out in 100 jil of 1x 
PCR buffer with all four dNTPs at 200 pM and Taq DNA polymerase at 
0.05 U/pl. Long PCR primers were used at 400 nM. The samples were 
subjected to 20 cycles of 98°C for 1 min, 60°C for 2 min, 72°C for 5 min 
and then 72°C for 10 min followed by chilling at 10°C and recovery from 
under oil. 



25 



30 
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Arraying onto nylon membranes 

1 ul aliquots from the PCR amplified samples were spotted 
onto Hybond N + nylon membranes. The membranes were then transferred 
to a stack of three sheets of 3MM paper, saturated with 0.4 M NaOH, and 
incubated for 10 minutes. The NaOH was rinsed away in 2x SSC and the 
membranes were used directly for the pre-hybridization. 



Probe synthesis 

A PCR master mix was prepared as follows (all volumes 

in \x\): 



Master Mix Preparation j 140 


Water 


183.6 


10x PCR buffer 


30 


2 mM dATP 


15 


2 mM dGTP 


15 


2 mM dTTP 


15 


2 mM dCTP 


3 


140 probe template 


1 


Luc140up-bio 


2.5 


Luc140down 


2.5 


Taq DNA polymerase (5 U/^l) 


2.4 


Total 


270 



Five PCR reactions were performed. For each reaction, 5 uJ 
15 of 33P-labelled dCTP was added to the PCR tube. The reactions were 
then made up to 50 ul by the addition of 45 ul of the master mix to each 
tube. Each reaction was gently mixed. 
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PCR cycling parameters 

94 C C 2 min 

50 °C 2 min 

5 

72 C C 2 min -\ 

g4 o C - 45sec j 25 cycles 

50 °C 1 min 

72°C 8 min 

4°C hold 

io After thermal cycling, the five PCR amplifications were 

pooled. The 250 yl of labelled DNA was then used for the following 
magnetic bead purification procedure. 

Capture of biotinylated PCR product and release of non-biotinylated 
15 strands 

All separations were carried out using a Dynal MPC-4 
separator (Dynal, product #1 2004). 

250 (il of the pooled PCR was mixed with an equal volume of 
10 mg/ml streptavidin-coated colloidal Fe304 particles in 20 mM tris-HCI 
20 (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The tube was incubated at room 
temperature for 1 hour with mixing on a Denley Orbital Mixer. 

The streptavidin-coated colloidal Fe304 particles were then 
washed with 1 ml of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Three more identical washes were performed. 
25 The washed streptavidin-coated colloidal Fe304 particles 

were incubated in 500 jil of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 500 y] of 0.5 M 
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HEPES. The samples were then. ethanol precipitated by the addition of 0.1 
volume of 3 M sodium acetate (pH 5.2) and 2.5 volumes of ethanol. chilling 
to 0 C C (on ice for 30 minutes), and then centrifugation at 20,000 maxRCF 
for 10 minutes. The pellets were rinsed with 70 % ethanol before 
5 dissolving in 100 \x\ of TE buffer. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 1.25 ml of pre-hybridization solution (5x SSC; Denhardt's 

in solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 

tetrasodium pyrophosphate; 100 jag/ml denatured, sonicated DNA - pre- 
warmed to 65°C) was added. Each box was closed and incubated at 65°C 
for 50 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 

is solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 jig/ml denatured, sonicated DNA - 
containing 5 jliI of the appropriate 33P-labelled probe) and the box was 
incubated at 65°C for 3 hours on a rocking platform. 

20 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 68°C for 30 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 71 °C for 30 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 
25 remove excess liquid. Once dry, the membrane was covered in Saran 

Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Results are shown in figures 4 and 5. 

30 
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Example #3a 

A single cycle of inter-population perfectly matched duplex depletion 
wherein E.coli MutS protein is used to capture an A/A mismatch- 
5 containing duplex 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by: attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture); taking denatured 

it) 'affected' DNA.fragments and hybridizing these to denatured and 
biotinytated 'unaffected* DNA fragments; and capture of mismatch- 
containing duplex molecules with the mismatch-binding protein. Releasing 
the mismatch-containing duplex molecules (without strand denaturation), 
streptavidin capture and then release of the non-biotinylated strands will 

15 give only the desired species. 

In this example, PCR fragments are prepared and used to 
demonstrate each of the individual steps for a single cycle of inter- 
population perfectly matched duplex depletion using E.coli MutS protein. 

20 Clone insert design 

The clone inserts were constructed using standard cloning 
methodology well known to those skilled in the art and were inserted 
between the Aval site and EcoRI of pMOSS/ue (Amersham Pharmacia 
Biotech). 

25 The clone inserts contain a common 9 base pair internal core 

sequence in which a single nucleotide change or an insertion can be 
located. The internal core sequence is derived from codons 272-274 of 
human p53. These codons (GTG CGT GGT) correspond to a mutational 
hotspot found in lung and other types of cancer (R273L). For the design of 

30 the clone inserts containing a mismatch (only one from a complete series of 
which is shown below), the nucleotide in position 5 of this core sequence is 



WO 00/55364 



PCT/GB00/00916 



- 138- 

modified (GTGCXTGGT). 

The internal core sequence is flanked by a random sequence 
- allowing the independent detection of the clone #1 and the clone #7 
insert sequence in a mixed population of clone inserts. 

5 

Clone #1 

Mutant sequence (#1M) 

1 0 5 ' CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCTTGGTTAGATATGCTTAT 
3 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGAACCAATCTATACGAATA 

GTTCACAAAATC ATCCTTGTAC AGAATTC 3 ' 
C AAGTGTTTTAGTAGGAAC ATGTCTTAAG 5 ' 

15 

Control sequence (#1C) 

5 ' CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCATGGTTAGATATGCTTAT 
3 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGTACCAATCTATACGAATA 

20 

GTTCACAAAATCATCCTTGTAC AGAATTC 3 ' 
CAAGTGTTTTAGTAGGAACATGTCTTAAG5 ' 

Clone #7 

25 

Control sequence (#7C) 

5 ' CCCGGGTGTACACAAAAGTTTACCTGAAGAACGTGGGGGGTCGTGCCTGGTCTTGCGTCACCTG 
3 ' GGGCCCACATGTGTTTTCAAATGGACTTCTTGCACCCCCCAGCACGGACCAGAACGCAGTGGAC 

30 

GTCTCAGGAGAGGGTCCCCATGGGAATTC3 ' 
CAGAGTCCTCTCCCAGGGGTACCCTTAAG5 ' 
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Preparation of upper strand 5'-biotinylated and lower strand 5'- 
biotinylated #1C double-stranded PCR product, and upper strand 5' 
biotinylated and lower strand 5'-biotinylated #7C double-stranded 
PCR product 

5 

Oligonucleotides 

BIOUPST2 5' bio-CTACTGATCGGATCCCCG 3" 

BIODOWN3 5' bio-AAACGACGGCCAGTGAAT 3' 

io PCR reaction set-up (#1C) 



#1C in pMOSS/ue at 2.50 ug/ml 
Water 

10x PCR buffer 
15 50 uM BIOUPST2 
50 uM BIODOWN3 
10 mM dNTPs 

Taq DNA polymerase (5 U/ul) 



100 ul 
8400 ul 
1000 ul 
100 u.l 
100 ul 
200uJ 
100 ul 



PCR reaction set-up (#7C) 



#7C in pMOSe/ue at 3.32 ug/ml 
Water 

1 0x PCR buffer 
50 uM BIOUPST2 
50 uM BIODOWN3 
10 mM dNTPs 

Taq DNA polymerase (5 U/pil) 



100 |il 
8400 ul 
1000 ul 
100 ul 
100 ul 
200ul 
100 ul 



30 
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96x 200 reactions were carried out for template #1C and 
96x 200 ul reactions were carried out for template #7C on a 96-well Perkin 
Elmer Cetus GeneAmp PCR System 9600 machine as described below: 



30 cycles 



95 °C, 5 minutes 1 cycle 
95 °C. 1 minute 
50 °C, 1 minute 
72 °C ; 1 minute 
72 °C, 5 minutes 1 cycle 
4°C. hold 



The PCR products were pooled together and precipitated by 
adding 0.1 volumes of 3 M sodium acetate and 1 volume of isopropanol 
followed by centrifugation at 16,000 rpm for 30 minutes at 4°C (in a 
Centrikon T-2070 ultracentrifuge; swinging bucket Kontron rotor TST 

io 41.14). Pellets were washed with 14 ml of ethanol and centrifuged at 
20,000 rpm for 30 minutes. Finally, the pellets were air-dried and 
resuspended in a total volume of 0.6 ml of TE buffer. 

The 0.6 ml PCR sample was purified in twelve Microspin S- 
300 HR columns (50 pi per column) following the manufacturer's protocol. 

15 Briefly, the resin in the columns was resuspended by vortexing. Columns 
were centrifuged at 735 x g (3000 rpm in a microfuge) for 1 minute. The 
sample was then applied to the centre of the resin, being careful not to 
disturb the bed. The columns were centrifuged at 735 x g for 2 minutes 
and the flow-through containing the PCR product was collected. The 

:o twelve eluted 50 (jl volumes were pooled together (pool 1). Columns were 
washed with 50 \i\ of TE buffer and the eluted fractions were loaded onto a 
fresh S-300 HR column. Product yield and removal efficiency of the PCR 
primers were analysed on a 1 .5 % agarose gel. 
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Preparation of non-biotinylated #1M single-stranded DNA and non- 
biotinyiated #7C single-stranded DNA 



Oligonucleotides 

BIOUPST2 5' b i o-CT ACTG ATC G G ATC C CC G 3' 

DOWN3 5' AAACGACGGCCAGTGAAT 3' 

PCR reaction set-up (#1M) 

io 

#1M in pMOSe/ue at 3.56 ug/ml 
Water 

1 0x PCR buffer 
50 uM BIOUPST2 
15 50uMDOWN3 
10 mM dNTPs 

Tag DNA polymerase (5 U/jul) 



100 jul 
8400 nl 
1000 ul 
100 p.t 
100 nl 
200ul 
100 nl 



20 



PCR reaction set-up (#7C) 

#7C in pMOSBlue at 3.32 ug/ml 100 

Water 8400 ul 

10x PCR buffer 1000ul 

50 uM BIOUPST2 100ul 

50uMDOWN3 100 ul 

10 mM dNTPs 200ul 

Taq DNA polymerase (5 U/ul) 1 00 ul 



30 



48x 200 hI reactions were carried out for template #1 M and 
48x 200 ul reactions were carried out for template #7C on a 96-well Perkin 
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Elmer Cetus GeneAmp PCR System 9600 machine as described below: 

95 °C, 5 minutes 1 cycle 
95°C ? 1 minute 
50 °C, 1 minute 30 cycles 
72 °C, 1 minute 
72 °C, 5 minutes 1 cycle 
4°C, hold 

Capture of biotinylated PCR product strands and release of non- 
5 biotinylated strands for the preparation of non-biotinylated #1 M 
single-stranded DNA and non-biotinylated #7C single-stranded DNA 

10 ml of pooled #1M and #7C PCR amplifications were each 
mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 
particles taken up in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
io NaCI. The tubes were incubated at room temperature for 60 minutes with 
mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
washed with 20 ml of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Two more identical washes were performed. 
1 5 The washed streptavidin-coated colloidal Fe 3 0 4 particles 

were finally incubated in 800 |al of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatants were removed and added to 200 jil of 2 M 
HEPES (free acid). Samples were quantified by absorbance at 260 nm. 

20 Denaturation and annealing 

Denaturation and annealing reactions were prepared by 

mixing: 



25 
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A = 400 ng of upper strand S'-biotinylated and lower strand 5'- 
biotinylated #1C double-stranded PCR product 
B = 400 ng of upper strand S'-biotinylated and lower strand S'- 
biotinylated #7C double-stranded PCR product 
C = 200 ng of non-biotinylated #1 M single-stranded DNA 
(the lower of the two #1 M strands shown above) . 
D = 200 ng of non-biotinylated #7C single-stranded DNA 
(the lower of the two #7C strands shown above) 

Reannealing between the upper strand of A and the single- 
stranded C will therefore give rise to an A/A mismatch-containing duplex, 
S'-biotinylated on the upper strand, for clone insert #1 . 

5 Reannealing between the upper strand of B and the single- 

stranded D will therefore give rise to a perfectly matched duplex, 
S'-biotinylated on the upper strand, for clone insert #7. 

0.1 volumes of 1 M NaOH were then added, followed by 
incubation at room temperature for 10 minutes. 0.25 volumes of 2 M 

to HEPES (free acid) were finally added followed by incubation at 42°C for 
1 hour. 

Samples were adjusted to 50 jil and were made 1x in PBS 
and 1 mg/ml in BSA ready for reaction with MutS protein-coated magnetic 
beads. 

I5 One 50 pi sample (the pre-enrichment control, sample 6) was 

used directly for capture of biotinylated PCR product strands and release of 
non-biotinylated strands. 

Mismatch capture 

20 20 nl of M 2 B 2 MutS protein-coated magnetic particles 

(Genecheck, lot #20) were added to the annealed DNA above. Samples 
were incubated for 1 hour at room temperature with shaking. 

Samples were then washed twice with 200 (il of ice-cold PBS. 
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Samples were finally eluted from the magnetic beads for 10 
minutes at room temperature in 50 ul of the following: 



Sample 1 


1 M NaCI 


Sample 2 


PBS 


Sample 3 


1 M urea 


Sample 4 


1% (w/v) SDS 


Sample 5 


10 mM NaOH 



5 Capture of biotinylated PCR product strands and release of non- 
biotinyiated strands 

All separations were carried out using an Amersham 
magnetic separator (RPN1682, batch #1). 

50 |il of the eluates from the MutS protein-coated magnetic 
io beads and the pre-enrichment control (sample 6) were each mixed with an 
equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 particles in 20 
mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The tubes were 
incubated at room temperature for 30 minutes with regular mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
is washed twice with 500 jul of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe 3 0 4 particles 
were incubated in 10 jil of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 [l\ of 2 M 
20 HEPES (free acid). 

5 (il fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following: 
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C = non-biotinylated #1M singie-stranded DNA 
(the lower of the two #1 M strands shown previously) 
D = non-biotinytated #7C single-stranded DNA 
(the lower of the two #7C strands shown previously) 



Probe labelling 
Oligonucleotides 

5 

#1 probe oligo 5 1 GGCCGAGTTTTGGTCCGTAG 3' 
#7 probe oligo 5' GTCTTGCGTCACCTGGTCTCAG 3 1 

io Preparation of probes 

#1 probe oligo and #7 probe oligo were radioactively 5' end- 
labelled using T4 polynucleotide kinase as described below (all volumes 
are in \x\)\ 





#1 probe 


#7 probe 


#1 probe oligo (25 pmol) 


1.25 




#7 probe oligo (25 pmol) 




1.25 


10x PNK buffer 


2.5 


2.5 


[y- 33 P] ATP >92 TBq/mmol, >2500 Ci/mmol, 370 
mBq/ml, 1 0 mCi/ml, 

(Amersham Pharmacia Biotech., AH9968, lot #B0006) 


12.5 


12.5 


T4PNK(10U/ul) 

(Amersham International, E70031Y, lot #201226) 


2.5 


2.5 


Water 


6.25 


6.25 



15 

The reactions were incubated at 37°C for 30 minutes and 
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then heated to 70°C for 5 minutes to denature the enzyme. 

MicroSpin G-25 column purification 

Two G-25 columns (APB 27-5325-01, lot #9015325011) were 
5 resuspended by vortexing, and the bottom closures were snapped off as 
described in the manufacturers instructions. A pre-spin of the columns 
was carried out for 1 minute at 730 maxRCF - 2670 rpm in a Hettich 
Zeutrifugen EBA 12 benchtop centrifuge - and the eluates were discarded. 
The 25 fit reactions were applied to each column, and the columns were 
lo centrifuged for 2 minutes at 730 maxRCF. The eluates from the second 
spin were stored and used as probes. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
1 5 plastic box and 2.5 ml of pre-hybridization solution (5x SSC; Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 (ig/ml denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42°C 
for 1 hour on a rocking platform. The pre-hybridization solution was 
:o removed and replaced with hybridization solution (5x SSC; Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 ng/ml denatured, sonicated DNA - 
containing 2.5 |il of the appropriate ^P-labelled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

25 

Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 42°C for 1 0 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
30 rinsed in 2x SSC at room temperature and laid out on blotting paper to 
remove excess liquid. Once dry, the membrane was covered in Saran 



WO 00/55364 



PCT/GBOO/00916 



-147- 

Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 



5 Dot blot layout 



Sample 1 
5 jil fraction 


Sample 2 
5 ul fraction 


Sampie 3 
5 |ii fraction 


Sample 4 
5 ul fraction 


Sample 5 
5 jii fraction 


Sample 6 
5 ul fraction 


200 ng 

#1 M ssDNA 


20 ng 

#1MssDNA 


2 ng 

#1M ssDNA 


200 pg 
#1M ssDNA 






200 ng 
#7C ssDNA 


20 ng 

#7C ssDNA 


2 ng 

#7C ssDNA 


200 pg 
#7C ssDNA 







ssDNA = single-stranded DNA 
io Probe hybridisation results 



#1 probe 


#7 probe 
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Signal intensities from the spots above were reported using 
ImageQuant 5.0 software (Molecular Dynamics), with the SumAboveBG 
15 figures being used after drawing a 6x3 grid over the array of spots. 
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#1 probe SumAboveBG signals 



Sample i 
142257 


Sample 2 
24356 


Sample 3 
180359 


Sample 4 
394296 


Sample 5 
465242 


Sample 6 
658323 


200 ng 
#1M ssDNA 
15961011 


20 ng 
#1M ssDNA 
2842937 


2 ng 
#1M ssDNA 
252392 


200 pg 
#1M ssDNA 
25316 


2281 


2475 


200 ng 
#7C ssDNA 
66536 


20 ng 
#7C ssDNA 
37955 


2 ng 
#7C ssDNA 
10621 


200 pg 
#7C ssDNA 
2736 


1955 


1934 


#7 probe SumAboveBG signals 


Sample 1 
66392 


Sample 2 
17071 


Sample 3 
118109 


Sample 4 
153372 


Sample 5 
166211 


Sample 6 
666853 


200 ng 
#1M ssDNA 
80873 


20 ng 
#1M ssDNA 
23256 


2 ng 
#1MssDNA 
6371 


200 pg 
#1M ssDNA 
2399 


1831 


1857 


200 ng 
#7C ssDNA 
10619299 


20 ng 
#7C ssDNA 
2052027 


2 ng 
#7C ssDNA 
207827 


200 pg 
#7C ssDNA 
15470 


2055 


1844 



Recovery 



Elution conditions 


#1 signal 
(% input ssDNA) 


#7 signal 
(% input ssDNA) 


#1/#7 signal ratio 


1 M NaCI 


21.6% 


10.0% 


2.2 


PBS 


3.7 % 


2.6 % 


1.4 


1 M urea 


26.4 % 


17.7% 


1.5 


1 % SDS 


59.9 % 


23.0 % 


2.6 


10 mM NaOH 


70 .7 % 


24.9 % 


2.8 



Recovery figures are plotted below for the various different 
elution conditions: 



WO 00/55364 



PCT/GB00/00916 



- 149 - 



100 




5 Example #3b 

A single cycle of inter-population perfectly matched duplex depletion 
wherein bacteriophage T4 endonuclease VIII protein containing a 
cleavage-inactivating N62D point mutation is used to capture an A/A 
io mismatch-containing duplex 

In this example, PCR fragments are again prepared and used 
to demonstrate each of the individual steps for a single cycle of inter- 
population perfectly matched duplex depletion using bacteriophage T4 
endonuclease VIII protein containing a cleavage-inactivating N62D point 
15 mutation. 

Clone design and DNA preparation 

Clone insert design was exactly as described in example #3a 
(see clone #1 : mutant sequence (#1 M), control sequence (#1C) and clone 
20 #7: control sequence (#7C)). 
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Preparation of upper strand S'-biotinylated and lower strand 
S'-biotinylated #1C double-stranded PGR product, and upper strand S'- 
biotinylated and lower strand S'-biotinylated #7C double-stranded PCR 
product were as described in example #3a. 
5 Preparation of non-biotinylated #1 M single-stranded DNA and 

non-biotinylated #7C single-stranded DNA were again as described in 
example #3a. 

Denaturation and annealing 

io Denaturation and annealing reactions were prepared as 

described in example #3a. 

Reannealing will therefore again give rise to an A/A 
mismatch-containing duplex, S'-biotinylated on the upper strand, for clone 
insert #1 and a perfectly matched duplex, S'-biotinylated on the upper 
15 strand, for clone insert #7. 

One 50 jil sample (the pre-enrichment control) was adjusted 
to 1 00 jil with TE buffer and was used directly for capture of biotinylated 
PCR product strands and release of non-biotinylated strands. 

20 Mismatch capture 

50 \i\ samples of the annealing reaction were mixed with an 
equal volume of 200 mM sodium phosphate (pH 6.5), 100 mM KCI. 10 fig 
of GST-tagged T4 endonuclease VII N62D mutant (obtained from Prof. 
Borries Kemper, Univ. Cologne) were added to the annealed DNA and the 
25 samples were incubated for 15 minutes at 16°C. 

Samples were then mixed with 200 jil of a 50 % slurry of 
Glutathione Sepharose 4B (Amersham Pharmacia Biotech, lot #279991) in 
100 mM sodium phosphate (pH 6.5), 50 mM KCI. The tubes were 
incubated at 16°C for 30 minutes with regular mixing. The mixture was 
.10 then transferred to a spin column to separate solid from liquid phases. 

Samples were finally eluted from the Glutathione Sepharose 
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4B matrix for 10 minutes at room temperature in 100 ul of 10 mM reduced 
glutathione in 50 mM tris-HCI (pH 8.0). 

Capture of biotinylated PCR product strands and release of non- 
5 biotinylated strands 

All separations were carried out using an Amersham 
magnetic separator (RPN1682, batch #1). 

100 \x\ of the eluate from the N62D T4 endonuclease VII 
mismatch-capture reaction, and the pre-enrichment control were each 
io mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 
particles in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The 
tubes were incubated at room temperature for 30 minutes with regular 
mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
15 washed twice with 500 \i\ of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe 3 0 4 particles 
were incubated in 10 \il of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 ^l of 2 M 
20 HEPES (free acid). 

5 jil fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following: 

C = non-biotinylated #1M single-stranded DNA 
(the lower of the two #1 M strands shown previously) 
D = non-biotinylated #7C single-stranded DNA 
(the lower of the two #7C strands shown previously) 

25 

Probe labelling 

Probe labelling was exactly as described in example #3a. 
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Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 2.5 ml of pre-hybridization solution (5x SSC; Denhardt's 

5 solution: 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 

tetrasodium pyrophosphate; 100|ig/ml denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42°C 
for 10 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 

io solution; 1 % SDS; 1 0 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100|ig/ml denatured, sonicated DNA - 
containing 2.5 [x\ of the appropriate 33 P-labelled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

15 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 42°C for 10 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 

20 remove excess liquid. Once dry, the membrane was covered in Saran 

Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Signal intensities from the spots above were again reported 

25 using ImageQuant 5.0 software (Molecular Dynamics), with the 

SumAboveBG figures being used after drawing a grid over the array of 
spots. 
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#1 probe SumAbov BG signals 



200 ng 
#1M ssDNA 
22418970 


20 ng 
#1M ssDNA 
3317389 


2ng 
#1M ssDNA 
270664 


200 pg 
#1M ssDNA 
25763 


GSH eluted 
sample 
144398 


200 ng 
#7C ssDNA 
229951 


20 ng 
#7C ssDNA 
105343 


2 ng 
#7C ssDNA 
16354 


200 pg 
#7C ssDNA 
3632 


Pre- 
enrichment 
control 
997077 


#7 probe SumAboveBG signals 


200 ng 
#1M ssDNA 
117267 


20 ng 
#1 M ssDNA 
42226 


2ng 
#1M ssDNA 
9421 


200 pg 
#1M ssDNA 
2812 


GSH eluted 
sample 
1 1 3608 


200 ng 
#7C ssDNA 
3665967 


20 ng 
#7C ssDNA 
22248096 


2ng 
#7C ssDNA 
203869 


200 pg 
#7C ssDNA 
19406 


Pre- 
enrichment 
control 
1135786 



Recovery 



Elution conditions 


#1 signal 
(% input ssDNA) 


#7 signal 
(% input ssDNA) 


#1/#7 signal ratio 


10 mM reduced 
glutathione in 50 mM 
tris-HCl (pH 8.0) 


14.5% 


10.0% 


1.45 



io The enrichment results are presented graphically below: 
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io Dedicated to the memory of Chris Griffin and Richard Beer 



